Example of a FASTA record

>gi|22777494|dbj|BAC13766.1| glutamate dehydrogenase [Oceanobacillus iheyensis]

MVADKAADSSNVNQENMDVLNTTQTIIKSALDKLGYPEEVFELLKEPMRILTVRIPVRMDDGNVKVFTGY
RAQHNDAVGPTKGGIRFHPNVTETEVKALSIWMSLKSGIVDLPYGGAKGGIICDPREMSFRELEALSRGY
VRAVSQIVGPTKDIPAPDVFTNSQIMAWMMDEYSKIDEFNNPGFITGKPIVLGGSHGRESATAKGVTIVL
NEAAKKKGIDIKGARVVIQGFGNAGSFLAKFLHDAGAKVVAISDAYGALYDPEGLDIDYLLDRRDSFGTV
TKLFNNTISNDALFELDCDIIVPAAVENQITRENAHNIKASIVVEAANGPTTMEATKILTERDILIVPDV
LASAGGVTVSYFEWVQNNQGFYWSEEEIDNKLHEIMIKSFNNIYNMSKTRRIDMRLAAYMVGVRKMAEAS

1. With the FASTA format, a single file can contain several records (sequences). Each record begins with ">".

2. gi|22777494 : the GenInfo Identifier number is the sequence identification number for a protein or a nucleotide sequence. If a sequence changes in any way, a new GI number will be assigned.

3. dbj|BAC13766.1| : one record could exist in different databases and may have many identifiers. The table gives the explanation of database name and identifier syntax. In this example, this record exists in the DNA Database of Japan under dbj|BAC13766.1.

4. dbj|BAC13766.1| : Database sequence identifiers run parallel to the new accession version system as sequence identifiers. In this example, the ".1" indicates that the sequence has been revised one time.

5. glutamate dehydrogenase [Oceanobacillus iheyensis] : description of the sequence. In this example, "glutamate dehydrogenase" is the name of the protein and Oceanobacillus iheyensis the organism from which it has been determined.

 

Database Name
Identifier syntax
GenBank
gb|accession|locus
EMBL Data Library
emb|accession|locus
DDBJ, DNA Database of Japan
dbj|accession|locus
NBRF PIR
pir||entry
SWISS-PROT
sp|accession|entry name
Brookhaven Protein Data Bank (PDB)
pdb|entry|chain
NCBI Reference Sequence
ref|accession|locus
Protein Research Foundation
prf||name
Local Sequence identifier
lcl|identifier
GenInfo Backbone Id
bbs|number
General database identifier
gnl|database|identifier
Patents
pat|country|number