Annotations

OpenGenomeBrowser is designed to be flexible when it comes to annotations. There are default annotation types, but they can be changed and new ones can be added.

Default annotation types

By default, OpenGenomeBrowser supports these types of annotations:

Annotation type	Description	Source
`GC`	Gene code	GenBank file
`GP`	Gene product	GenBank file
`EP`	Eggnog product	Eggnog file
`ED`	Eggnog description	Eggnog file
`EO`	Eggnog ortholog	Eggnog file
`EC`	Enzyme commission	GenBank file and Eggnog file
`KG`	KEGG gene	Eggnog file
`KR`	KEGG reaction	Eggnog file
`GO`	Gene ontology	Eggnog file
`SL`	Subcellular location	-
`OL`	Ortholog	OrthoFinder

These are defined in the file annotations.json, located next to the organisms folder.

├── organisms
│    └── ...
└── annotations.json

Custom annotation types

New annotation types can simply be added by adding a new entry to this annotations.json file.

For example, this is what the definition of the EC annotation type looks like:

{
  "EC": {
    "name": "EC Number",
    "color": "rgb(222,203,228)",
    "regex": "^EC:[0-9\\.-]{1,12}$",
    "hyperlinks": [
      {
        "name": "SIB Rhea DB",
        "url": "https://www.rhea-db.org/rhea?query=${annotation}"
      },
      {
        "name": "BRENDA",
        "url": "https://www.brenda-enzymes.org/enzyme.php?ecno=${annotation.substring(3)}"
      }
    ]
  },
  "GO": "etc..."
}

Custom annotation files

Create files that map genes to annotations in this way: A valid custom_annotations file must be in this format: locustag\tannotation1, annotation2, annotation3

I.e., this would be a valid file:

EXAMPLE1234-2-1.1_000001	K000001
EXAMPLE1234-2-1.1_000004
EXAMPLE1234-2-1.1_000008	K000001, K000002

Add them to genome.json as described here.

Annotation descriptions (optional)

OpenGenomeBrowser allows its users to extend annotation identifiers such as e.g. K01695, EC:4.2.1.20 and GO:0005829 with a short description to in order to facilitate interpretation. To use this feature, simply provide a file for each annotation type which maps the annotation identifiers to their description. These files must be located in the folder annotation-descriptions and named {annotation_type}.tsv}.

├── organisms
│    └── ...
├── annotations.json
└── annotation-descriptions
     ├── EC.tsv
     └── ...

This is the expected format: annotation\tdescription

EC:4.2.1.17	beta-hydroxyacyl-CoA dehydrase
EC:4.2.1.18	methylglutaconyl-CoA hydratase
EC:4.2.1.19	imidazoleglycerol-phosphate dehydratase
EC:4.2.1.20	L-tryptophan synthetase

Import annotation descriptions

This step is only required if these files were added after the genomes were imported.

[Open a terminal in the container]https://github.com/opengenomebrowser/opengenomebrowser-docker-template#open-a-terminal-in-the-container
run python db_setup/manage_ogb.py import-annotation-descriptions