Kinetoplast Kristmas

Al Ivens alicat at sanger.ac.uk
Sat Dec 16 10:23:52 BRST 2000


Dear Colleagues,

The PSU at the Sanger Centre are happy to announce the deposition of a
large wudge of kinetoplast DNA sequence in the EMBL database.

***  T. brucei  ***
T. brucei home page: http://www.sanger.ac.uk/Projects/T_brucei/
T. brucei Genome Network home page: http://parsun1.path.cam.ac.uk/

43,196 new GSS sequences were submitted to EMBL late this week, and
should be available on Monday for public access
(http://www.ebi.ac.uk/Databases/index.html).  Access from GenBank should
be possible within ~ a week's time
(http://www.ncbi.nlm.nih.gov/Entrez/nucleotide.html).  

However, for speedy access right now, a gzipped fasta database of these
sequences is available from our ftp site:

ftp://ftp.sanger.ac.uk/pub/databases/T.brucei_sequences/GSS/

A list of accessions and IDs is also provided on the ftp site.  The GSS
have not yet been placed in the T. brucei blastserver database, but will
be in the New Year.  The GSS sequences are currently being BLASTX'ed
against Swissprot/TrEMBL, and the top hits from the first batch of ~3600
GSS seqs can be found at: 

http://www.sanger.ac.uk/Projects/T_brucei/0001Blasthits.html

Additional BLASTX hit data will be put on the www, as and when they are
completed.


***  L. major  ***
L. major home page: http://www.sanger.ac.uk/Projects/L_major/
Leishmania Genome Network home page:
http://www.ebi.ac.uk/parasites/leish.html

Several cosmid and PAC sequences have been returned by members of the
EULEISH consortium, and submitted to EMBL.  They are:

Final sequence, reannotation
LMFP1408      109,041 bp AL358652
LMFL2185       35,424 bp AL358712
LMFL5856       23,600 bp AL357592
LMFL6754       15,121 bp AL358632
LMFL5852       31,392 bp AL499614

Final, new sequence, annotated
LMFL4766       26,122 bp AL499615

Pre-final, new sequences, not annotated yet
LMFL9356       23,973 bp AL499612
LMFL4812       33,109 bp AL499613
LMFL490        29,151 bp accesssion not received yet
LMFL654        19,179 bp accesssion not received yet
LMFP696       176,944 bp accesssion not received yet

In addition, the PSU has been doing a large number of whole chromosome
shotguns (WCS) of PFG-purified Leishmania chromosomes, plus "skims" of
suitable mapped cosmids and/or PACs.  

A ***preliminary*** assembly of these has been done, and all sequence
contigs greater than 500bp submitted to EMBL.  Shotgun is *not*
completed for the majority, these are still work-in-progress:
HTGS_PHASE1.

PLEASE NOTE THAT THESE SEQUENCES HAVE NOT BEEN EDITED AT ALL AND SHOULD
BE USED WITH CAUTION!!!

LMFLCHR25  AL499618  Chromosome 25, 667 contigs, 918,546 bp
LMFLCHR16  AL499619  Chromosomes 16&17, 800 contigs, 1,030,105 bp
LMFLCHR18  AL499620  Chromosomes 18&19&20&22, 721 contigs, 1,091,246 bp
LMFLCHR31  AL499621  Chromosome 31, 616 contigs, 1,027,083 bp
LMFLCHR32  AL499622  Chromosomes 32&33, 1737 contigs, 2,727,709 bp
LMFLCHR34  AL499623  Chromosome 34, 654 contigs, 697,361 bp
LMFLCHR36  AL499624  Chromosome 36, 1511 contigs, 1,650,993 bp

It should also be noted that some cross-contamination between
PFG-isolated chromosomes does occur, although we have endeavoured to
maximise chromosome purity prior to library construction.  

Data from chromosomes 15, 26 and 28 have been available for some time,
and can be found on their respective www pages, e.g.:

http://www.sanger.ac.uk/Projects/L_major/chromosome15.shtml
http://www.sanger.ac.uk/Projects/L_major/chromosome26.shtml
http://www.sanger.ac.uk/Projects/L_major/chromosome28.shtml

On my return in the New Year, a low-level annotation will be performed
on the new WCS, and resubmitted.

Wishing you all a Happy Christmas and New Year, I'm off to snowy
Scotland now!

Cheers!

al


==================================
Al Ivens
Pathogen Sequencing Unit
Sanger Centre
Wellcome Trust Genome Campus
Hinxton
Cambs. CB10 1SA
UK

Tel -44-1223- 49 48 51
Fax -44-1223- 49 49 19
Email: alicat at sanger.ac.uk


More information about the Leish-l mailing list