Cyanorak help menu

Cyanorak v2.1 is an information system dedicated to the curation, comparison and visualization of genomes belonging to a major cyanobacterial phylum (Cluster 5 sensu Herdman et al. (2001). The current version encompasses 41 genomes and 2 MAGs of the strictly marine genus Prochlorococcus and 54 marine, brackish or freshwater representatives of the polyphyletic Synechococcus/Cyanobium group. The core of Cyanorak is a database of manually curated ‘Clusters of likely orthologous genes’ (CLOGs) that can be accessed mainly by search tools.

The Top banner on a dark blue background present on all pages gives access to the different menus of the database, including:

Genomic Search results are displayed in distinct tabs, as detailed below.

Genomic Search results are organized into three distinct tabs: ‘matching clusters’, ‘matching CDS’ and ‘matching RNAs’. Each list gives the essential information about every match and provides links to access detailed descriptions of these entries and can be exported in various formats (CSV, XLS or pdf).

On top left of the CLOG page is a link to the ‘genomic context’, which displays the four genes upstream and downstream of the selected gene in all members of the CLOG. Two possible representations of the context are accessible through a toggle button: genes are shown either all at the same size or in relative size. To ease comparisons, the central gene is always represented in forward direction whatever its original direction in the genome and the context is arranged accordingly. Each CLOG has a (random) color and background (plain or stripped) and the genome context can be regenerated around any gene of the current context by clicking on the ORF_ID of the corresponding gene, while clicking on a CLOG number opens the corresponding CLOG page. All nucleotides and/or amino acid sequences of a given CLOG can be exported in Fasta format using links at the very bottom of the page.

The middle of the CLOG page consists in a phyletic pattern, i.e. the distribution of the genes in the different genomes, classified by taxonomy (genus, clade and, for Synechococcus SC 5.1 only, subclades), and indicating the pigment type of the corresponding strain.

The bottom of the CLOG page lists individual sequences comprised in the CLOG. Of note, for the sake of homogeneity all sequences, even when initially retrieved from  Genbank, have been attributed a standardized Cyanorak ORF_ID consisting in CK_genus name (Pro/Syn or Cya)_strain name_gene number in five digits, the latter digits corresponding to the initial locus tag. For instance, The Prochlorococcus MED4 gene PMM1063 has been given the ORF_ID CK_Pro_MED4_0163 while the Synechococcus sp. CB0205 locus tag SCB02_010100012987 has been given the Cyanorak ORF_ID CK_Syn_CB0205_12987. The annotation provided in this part is the initial gene annotation for genomes coming from Genbank, while the annotation found on the corresponding gene in in genome exports and in the JBrowse viewer will always be the annotation made at the CLOG level (top of page), which is often different.  Many genes, even in genomes initially coming from Genbank, have had their starts corrected when they were either too long or too short. Genomes have also been modified by adding a number of missing ORFs, while many wrong ORF predictions have been suppressed.

JBrowse viewer

The user can explore any ‘manually curated’ genome of the Cyanorak database using a JBrowse viewer. For all genomes of Cyanorak, genes have been attributed the standardized Cyanorak ORF_ID (see above) and have been assigned the detailed functional annotations found in individual CLOG page of the database.

The JBrowse viewer can be accessed by clicking on the ‘J’ logo next to each strain name either on the Organisms or the JBrowse page. The JBrowse environment allows the user to zoom in the genome using the google-plus button to see the local context and detailed annotation of any gene. Genes of interest can be searched for in the genome by entering text (e.g. ‘pheT’ or any Cyanorak ORF_ID) in the box next to chromosome. Arrows allows to navigate in the immediate environment of the gene. Right clicking on a gene gives access to its detailed functional annotation, as derived from the CLOG page, with hyperlinks to Cyanorak or external public databases.

The user can also view other genome information for a number of strains using the ‘select tracks’ menu located top left of JBrowse pages. Tracks of interest must be selected by clicking on boxes then can be visualized below the genome sequence by clicking on “back to browser”.

Available tracks include:

If users have generated transcriptomic data corresponding to one of the genomes included in the Cyanorak database and wish to have Log2(FC) values publicly visualized using the Cyanorak Jbrowse viewer, they should provide us with:

Exports

Various exports are available from different pages of the Cyanorak v2.1 information system, including strain and genome characteristics from the Organisms page, annotated complete genomes from the individual strain pages, protein and nucleotide sequences in fasta format from individual gene pages and multifasta protein and nucleotide sequences files from individual cluster pages.

How to Cite:

Garczarek L., Guyet U., Doré H., Farrant G.K., Hoebeke M., Brillet-Guéguen L., Bisch A., Ferrieux M., Siltanen J., Corre E., Le Corguillé G., Ratin M., Pitt F.D., Ostrowski M., Conan M., Siegel A., Labadie K., Aury J.-M., Wincker P., Scanlan D.J. & F. Partensky (2021). Cyanorak v2.1: a scalable information system dedicated to the visualization and expert curation of marine and brackish picocyanobacteria genomes. Nucleic Acids Research Database issue, in press.

Cyanorak Roles

-

Unclassified

 

Role category not yet assigned

-

-

0

Non-coding gene (RNA)

0.1

tRNA

-

-

0.2

rRNA

-

-

0.3

ncRNA

-

-

0.4

Other

-

-

A

Amino acid biosynthesis

A.1

Aromatic amino acids family (Phe, Trp, Tyr)

-

-

A.2

Aspartate family (Asp, Asn, Ile, Lys, Met, Thr)

-

-

A.3

Glutamate family (Arg, Gln, Glu, Pro)

-

-

A.4

Histidine family

-

-

A.5

Pyruvate family (Ala, Val, Leu)

-

-

A.6

Serine family (Ser, Gly, Cys)

-

-

A.7

Other

-

-

B

Biosynthesis of cofactors, prosthetic groups, and carriers

B.1

Lipoic acid

-

-

B.2

Menaquinone and ubiquinone

-

-

B.3

Molybdopterin

-

-

B.4

Pantothenate and coenzyme A

-

-

B.5

Pigments

B.5.1

Carotenoids

B.5.2

Chlorophylls and porphyrins

B.5.3

Hemes and phycobilins

B.6

Pyridine nucleotides (NAD, NADP)

-

-

B.7

Riboflavin, FMN and FAD

-

-

B.8

Siderophores

-

-

B.9

Thioredoxin, glutaredoxin, and glutathione

-

-

B.10

Vitamins

B.10.1

Biotin (b7)

B.10.2

Cobalamin (b12)

B.10.3

Folic acid (b9)

B.10.4

Pyridoxine (b6)

B.10.5

Thiamine (b1)

B.11

Other

-

-

C

Cell envelope

C.1

Membranes, lipoproteins, and porins

-

-

C.2

Murein sacculus and peptidoglycan

-

-

C.3

Surface polysaccharides and lipopolysaccharides

-

-

C.4

Surface structures

-

-

C.5

Other

-

-

D

Cellular processes

D.1

Adaptation/acclimation to atypical conditions and detoxification

D.1.1

Iron

D.1.2

Light

D.1.3

Nitrogen

D.1.4

Oxidative stress

D.1.5

Phosphorus

D.1.6

Temperature

D.1.7

Trace metals

D.1.8

Other

D.2

Cell division

-

-

D.3

Cell growth and death

-

-

D.4

Chaperones

-

-

D.5

Chemotaxis and motility

-

-

D.6

Circadian clock

-

-

D.7

Protein and peptide secretion

-

-

D.8

Toxin production and resistance

-

-

D.9

Transformation

-

-

D.10

Other

-

-

E

Central intermediary metabolism

E.1

Amino sugars

-

-

E.2

One-carbon metabolism

-

-

E.3

Phosphorus metabolism

-

-

E.4

Nitrogen metabolism

-

-

E.5

Polyamine biosynthesis

-

-

E.6

Polysaccharides and glycoproteins biosynthesis

-

-

E.7

Sulfur metabolism

-

-

E.8

Other

-

-

F

DNA metabolism

F.1

 

DNA replication, recombination, and repair

 

F.1.1

Base excision repair

F.1.2

Nucleotide excision repair

F.1.3

Mismatch repair

F.1.4

Homologous recombination

F.1.5

Non-homologous end-joining

F.1.6

Reversal of damage

F.2

Restriction/modification

-

-

F.3

Other

-

-

G

Energy metabolism

G.1

Amino acids and amines (catabolism)

-

-

G.2

Electron transport

-

-

G.3

Glycolate pathway

-

-

G.4

Glycolysis/gluconeogenesis

-

-

G.5

Pentose phosphate pathway

-

-

G.6

Pyruvate and acetyl-CoA metabolism

-

-

G.7

Pyruvate dehydrogenase

-

-

G.8

Starch and sucrose metabolism

-

-

G.9

TCA cycle

-

-

G.10

Other

-

-

H

Fatty acid, phospholipid and sterol metabolism

H.1

Ether lipid metabolism

-

-

H.2

Fatty acid metabolism

-

-

H.3

Glycerolipid metabolism

-

-

H.4

Glycerophospholipid metabolism

-

-

H.5

Ketone bodies metabolism

-

-

H.6

Linoleic acid metabolism

-

-

H.7

Sphingolipid and glycosphingolipid metabolism

-

-

I

Mobile and extrachromosomal element functions

I.1

Plasmid functions

-

-

I.2

Prophage functions

-

-

I.3

Transposon functions

-

-

I.4

Other

-

-

J

Photosynthesis and respiration

J.1

ATP synthase

-

-

J.2

CO2 fixation

-

-

J.3

Cytochrome b6/f complex

-

-

J.4

Light-harvesting-Pcb

-

-

J.5

Light-harvesting-Phycobilisomes

J.5.1

Allophycocyanin

J.5.2

Phycocyanin

J.5.3

Phycoerythrin

J.5.4

Phycobilin lyase

J.5.5

Other

J.6

NADH dehydrogenase

-

-

J.7

Photosystem I

-

-

J.8

Photosystem II

-

-

J.9

Respiratory terminal oxidases

-

-

J.10

Soluble electron carriers

-

-

J.11

Other

-

-

K

Protein synthesis

K.1

tRNA aminoacylation

-

-

K.2

Ribosomal proteins: synthesis and modification

-

-

K.3

tRNA and rRNA base modification

-

-

K.4

Translation factors

-

-

K.5

Other

-

-

L

Protein fate

L.1

Protein and peptide secretion and trafficking

-

-

L.2

Protein modification and repair

-

-

L.3

Protein folding and stabilization

-

-

L.4

Degradation of proteins, peptides, and glycopeptides

-

-

L.5

Other

-

-

M

Purines, pyrimidines, nucleosides, and nucleotides

M.1

2'-Deoxyribonucleotide metabolism

-

-

M.2

Nucleotide and nucleoside interconversions

-

-

M.3

Purine ribonucleotide biosynthesis

-

-

M.4

Pyrimidine ribonucleotide biosynthesis

-

-

M.5

Salvage of nucleosides and nucleotides

-

-

M.6

Sugar-nucleotide biosynthesis and conversions

-

-

M.7

Other

-

-

N

Regulatory functions

N.1

DNA interactions

-

-

N.2

RNA interactions

-

-

N.3

Protein interactions

-

-

N.4

Small molecule interactions

-

-

N.5

Other

-

-

O

Signal transduction

O.1

Two-component systems

O.1.1

Histidine kinase (HK)

O.1.2

response regulators  (RR)

O.2

Phosphotransferase systems (PTS)

-

-

O.3

Other

-

-

P

 

Transcription

 

P.1

Degradation of RNA

-

-

P.2

DNA-dependent RNA polymerase

-

-

P.3

Transcription factors

-

-

P.4

RNA processing

-

-

P.5

Other

-

-

Q

Transport and binding proteins

Q.1

Amino acids, peptides and amines

-

-

Q.2

Anions

-

-

Q.3

Carbohydrates, organic alcohols, and acids

-

-

Q.4

Cations and iron carrying compounds

-

-

Q.5

Nucleosides, purines and pyrimidines

-

-

Q.6

Porins

-

-

Q.7

Sugars

-

-

Q.8

Other

-

-

Q.9

Unknown substrate

-

-

R

Other

R.1

Conserved hypothetical domains

-

-

R.2

Conserved hypothetical proteins

-

-

R.3

Enzymes of unknown specificity

-

-

R.4

Hypothetical proteins

-

-

R.5

Other