Bioinformatics analysis of the structure - function relationship of glutamate dehydrogenase

Préambule

On peut considérer la première réaction d'assimilation de l'azote (sous forme d'ammoniac) par la glutamate déshydrogénase (GDH) comme un point d'entrée dans le métabolisme azoté. L'atome d'azote est à l'origine de la fonction a-aminée des acides aminés selon la réaction :

NH3+ + α-cétoglutarate + NAD(P)H + H+ <======> glutamate + NAD(P)+

Il existe trois isoformes de GDH :

  • la GDH EC 1.4.1.2 qui catalyse la réaction dans le sens de la désamination essentiellement
  • la GDH EC 1.4.1.3 qui catalyse la réaction dans les deux sens
  • la GDH EC 1.4.1.4 (GDH4) qui catalyse la réaction dans le sens de formation du glutamate

La GDH4 joue peut-être un rôle clé dans l'assimilation de l'azote. Or ce rôle n'a pas encore été démontré, notamment chez les plantes. Par ailleurs, on ne dispose d'aucune information concernant la structure de la GDH4.

La bioinformatique permet l'étude prospective de la relation structure - fonction de la GDH.

 

 

In term of taxonomy, the closest organism to higher plants (Eukaryota, Viridiplantae, Streptophyta) from which sequences of glutamate dehydrogenase EC 1.4.1.4 are known is an algae, Chlorella sorokiniana (Eukaryota, Viridiplantae, Chlorophyta).

Thus, the two sequences of this isoform from Chlorella sorokiniana were taken as the reference (Ref) subset in this study.

 

a. Overview of the search for amino acid sequences of the 3 isoforms of GDH

  • 1. Go to the NCBI
  • 2. Choose "ENTREZ"
  • 3. Tape : "glutamate dehydrogenase OR GDH": 12293 hits (10/12/2010)
  • 4. Click on : "Protein: sequence database"
  • 5. Choose the option "Preview/Index" . This allows the use of key-words with the option : "Add Term(s) to Query or View Index"and a boolean logical search ("AND" / "OR" / "NOT")

The number of hits is indicated on the right of the screen. To see the results ("Summary"), click on this number in your browser.

See the results

 

b. Classification of the 116 selected full GDHs amino acid sequences

116 non-redundant full GDH sequences were obtained from 83 organisms representing the three domains, Archaea, Bacteria and Eukaryota. These sequences were classified in 15 different subsets using the following criteria :

  • the EC number
  • the length of the polypeptide chain
  • their Viridiplantae (higher plant) belonging or not

The following table gives the number of amino acid sequences of GDH for each subset (letter).

The length range of the polypeptide chains are : L1 = [411 : 470] - L2 = [503 : 558] - L3 = [1029 : 1106] - L4 = [1607 : 1651]

EC

number

Viridiplantae
 

not Viridiplantae

total

L1
L2
L3
L4

L1

L2

L3

L4

1.4.1.2

9 (A)

     

7 (B)

   

1 (C)

17

1.4.1.3

1 (D)

     

6 (E)

7 (F)

   

14

1.4.1.4

 

2 (Ref)

   

15 (G)

     

17

not

classified

6 (H)

     

13 (I1)

18 (I2)

14 (I3)

5 (J)

5 (K)

7 (L)

68

total

16

2

   

73

12

5

8

116

The sequences of each subset were further aligned to obtain the 15 full consensus amino acids sequences.

c. Comparison of the 15 full consensus GDHs sequences

a. Unzip (untar) the text file containing the 15 full consensus sequences in FASTA format :   

FullConsSeq.zip              FullConsSeq.tar

b. Open the file with a text editor. Copy only the data begenning with a ">".

c. Go to ClustalW (EBI). Paste the data into the window.

d. Select the appropriate matrix and parameters (below) and run the software.

  • Matrix : Gonnet
  • Gapopen : 1
  • Gapext : 1
  • Other parameters : default value

e. The results are returned : ".aln" is the alignment.

 

Result

The schematic alignment of the full consensus sequences shows that GDH subunit is constituted of two or three regions :

  • the N-terminal extension
  • a common pattern to all consensus sequences corresponding to the central domain that contains the substrate and the nucleotide binding sites
  • for large GDH (subsets C, K and L), the C-terminal extension

The length indicated includes the gaps generated by the alignment.

 

d. Analysis of the central domain of GDH : the dinucleotide-binding motif

A bab fold is found in the NAD(P)H-binding subdomain (β7 - α8 - β8).

This Rossmann fold begins with the motif G313AGNVA318 in the case of Ref.

However, the alignment indicates that the actual motives could be more complex.

Such a higher complexity of the signature for the NAD(P)H-binding motif allows to discriminate more precisely the three isoforms.

 

This figure was generated using the software ESpript.

 

  • Secondary structures indicated above the alignment were generated using as the template the bovine GDH3 complexed with NADPH and Glu (PDB # 1HWZ).
  • Amino acid position indicated above the alignments is that of Ref (blue sequence).
  • Plain red vertical boxes : amino acids identical for all consensus subsequences.
  • Open red vertical boxes : amino acids whose homology between all consensus subsequences was greater than 60%.
  • The letter "X" accounts for an amino acid whose identity level was less than 60% after the first alignment of full consensus sequences.
  • The NAD(P)H-binding motif G313AGNVA318 (Ref) is indicated at the bottom of the frame with red circles.

 

e. Search of a second NAD(P)H-binding site

Aldehyde dehydrogenase from Vibrio harveyi is one of the most NADP-specific.

The alignment of GDH from Ref and aldehyde DH shows that :

- there are three putative key residues for the binding of NADP(H) in Ref : Lys202, Ser205 (triangles) and Arg248 (asterisk)

- the NAD(P)H-binding motif G229SVGGG234 of aldehyde DH is aligned with the motif G266VLTGKG272 of Ref (open circles)

Therefore, the latter is likely a second nucleotide-binding motif specific of GDH4.

This figure was generated using the software ESpript.

 

f. Modelisation of the dinucleotide-binding motives and key residues of GDH4 with NADPH (NDP562) and Glu

A theoretical 3D structure of GDH4 from Ref was generated with the homology-modeling program ESyPred3D using as the template the structure of bovine GDH3 (PDB # 1HWZ).

The modelisation and the drawing of a putative structure of GDH4 was performed with the protein structure homology-modeling program DeepView (SwissPdb-Viewer v. 3.7).

Some interactions (plain lines) between the motif G313AGNVA318 or key residues and the coenzyme are indicated :

  • NDP562AO3 - Gly313CA
  • NDP562AO1 - Asn316ND2
  • NDP562AO1 - Val317N
  • NDP562AO2 - Gly244N
  • NDP562NC4 - Thr285OG1

 

 

 

The distances between the protonated carbon atom of the nicotinamide moiety (NDP562NC4) are too long for direct interactions with the motif G313AGNVA318.

However, this motif is stabilized by an internal H-bond Gly315O - Ala318N (dotted line).

Two distances (Glu557OE2 - Lys166NZ and Glu557O - Lys190NZ) are compatible with H-bond interactions between the enzyme and Glu.

The position of the motif G266VLTGKG272 is shown with the potential H-bond Lys166NZ - Thr269OG1.

 

ESpript : Gouet, P., Courcelle, E., Stuart, D. I. & Metoz, F. (1999) "ESPript : analysis of multiple sequence alignments in PostScript" Bioinformatics 15, 305 - 308

ESyPred3D : Lambert, C., Leonard, N., De Bolle, X. & Depiereux, E. (2002) "ESyPred3D : Prediction of proteins 3D structures" Bioinformatics 18, 1250 - 1256

Swiss-PdbViewer : Guex, N. & Peitsch M. C. (1997) "SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling" Electrophoresis 18, 2714 - 2723