|
|
Example of a FASTA record |
|
>gi|22777494|dbj|BAC13766.1| glutamate dehydrogenase [Oceanobacillus iheyensis] MVADKAADSSNVNQENMDVLNTTQTIIKSALDKLGYPEEVFELLKEPMRILTVRIPVRMDDGNVKVFTGY RAQHNDAVGPTKGGIRFHPNVTETEVKALSIWMSLKSGIVDLPYGGAKGGIICDPREMSFRELEALSRGY VRAVSQIVGPTKDIPAPDVFTNSQIMAWMMDEYSKIDEFNNPGFITGKPIVLGGSHGRESATAKGVTIVL NEAAKKKGIDIKGARVVIQGFGNAGSFLAKFLHDAGAKVVAISDAYGALYDPEGLDIDYLLDRRDSFGTV TKLFNNTISNDALFELDCDIIVPAAVENQITRENAHNIKASIVVEAANGPTTMEATKILTERDILIVPDV LASAGGVTVSYFEWVQNNQGFYWSEEEIDNKLHEIMIKSFNNIYNMSKTRRIDMRLAAYMVGVRKMAEAS 1. With the FASTA format, a single file can contain several records (sequences). Each record begins with ">". 2. gi|22777494 : the GenInfo Identifier number is the sequence identification number for a protein or a nucleotide sequence. If a sequence changes in any way, a new GI number will be assigned. 3. dbj|BAC13766.1| : one record could exist in different databases and may have many identifiers. The table gives the explanation of database name and identifier syntax. In this example, this record exists in the DNA Database of Japan under dbj|BAC13766.1. 4. dbj|BAC13766.1| : Database sequence identifiers run parallel to the new accession version system as sequence identifiers. In this example, the ".1" indicates that the sequence has been revised one time. 5. glutamate dehydrogenase [Oceanobacillus iheyensis] : description of the sequence. In this example, "glutamate dehydrogenase" is the name of the protein and Oceanobacillus iheyensis the organism from which it has been determined. |
|