Toward a Unified Gene Page GMOD Meeting, April 2004, Don Gilbert, gilbertd@indiana.edu Gene Pages * Common parts of MOD gene pages * What could/should be unified? * Who will benefit? Costs? * Web Reports AND XML ? * You: discuss, design common gene page Common gene attributes * Names, symbols/IDs, synonyms * Map locations * Sequences * Reagents * Gene ontology * Similar Genes * Database cross-refs, External links * Alleles, Transcripts * Proteins, Structure and Domains * Expression and Mutant Phenotypes * Gene Interactions * Literature references * Summary Text * .. others? Common to gene pages? * Labels - are these same things? -- Gene / locus / orf -- Homolog / ortholog / relationship / similarity -- Citation / publication / reference * Organization of document -- Section headers -- Important at top, common ordering? Common to gene pages? * Structure and size of default document -- Tabular, text, document-like, ... -- One screen or long report * Graphics (maps, icons, ...) * Further Detail options * Layout and Design (colors, formatting, fonts ..) What is customizable? * MOD customizations -- Look and Feel -- Details & Extensions * Customer choices -- Best for organism community (org. standard) -- Best for general reader (general standard) -- Best for beginners or experts (simple,complex) Example Gene Pages Common Gene XML? * Computable text of gene page ? -- "what you see (web page) is also what your computer can read" -- simple and human-readable, or complex and detailed * XML variants, tabular, other? -- Ace2XML, NCBI XML, others -- Samples (Web -> XML) Common tools? * Common gene page schema? * Labels CV/Ontology, XML DTD/template * Software, web templates ? * html, xml/xslt, perl, java * Adaptable to various databases? Tasks * Now: Design your common gene page * Cut/paste sample XML pages into one * Comment on what should be unified * Comment on best parts of MOD pages * After: Talk MODs into generating these http://eugenes.org/all/gene-report-examples/ HTML and XML Fly Gene Page as XML FlyBase: Cam Gene Synopsis Cam CG8472; CG8472; .. Calmodulin FBgn0000253 28 Feb 04 2R ... EF-hand family Fly GRID .. ... Cam+P148 148 .. calcium ion binding calmodulin binding .. .. D. melanogaster gene Calmodulin.. Yeast Gene Page as XML SGD: CMD1/YBR109C Gene Report CMD1 YBR109C ORF, Verified master regulator of calcium mediated signalling calmodulin Systematic deletioninviable II:458318-457875 ... 53 MIPS ... calcium ion binding ... ... S0000313 ORF map SCOP superfamily (MRC/Stanford) YeastRC Two-Hybrid Mouse Gene Page as XML MGI 2.98 - Marker Detail Gene Detail Calm1 calmodulin 1 MGI:88251 Nomenclature History Chromosome 12 .. human; rat Entrez:NM_009790 Calcium-binding EF-hand Mouse Locus Catalog calcium ion binding ... ... GXD literature index (2) NIA Mouse Gene Index J:9357 Kluxen FW et al., "Opposite regulation of the mRNAs for parvalbumin and p19/6.8 in myotonic mouse muscle." Eur J Biochem 1988 Sep 1;176(1):153-8 Worm Gene Page as XML WormBase: cmd-1 Gene Summary cmd-1 encodes a putative homolog of calmodulin 1 that affects growth rate and fertility. Caenorhabditis elegans cmd-1 (CGC approved) AJ132193 T21H3.3 confirmed by cDNA(s) Putative C. briggsae ortholog CBG01097 calcium ion binding IEA ... ... V:1156138..1157022 C. briggsae BP:CBP00318 gene CBG01097 5e-76 99.3