Database
a structured collection of data held in computer storage
• especially one that incorporates software to make it accessible in a variety of ways
• any large collection of information
Database management
the organization and manipulation of data in a database
Database management systems (DBMS)
a software package that provides all the functions required for database management
Database system
a database together with a database management system
What is a database?
a collection of data…
structured • searchable (index) --> table of contents • updated periodically (release) --> new edition • cross-referenced (hyperlinks) --> links with other databases
A database includes
tools (software) necessary for • access • updating • information insertion • information deletion etc
Database storage management
* relational databases
Flat file
Relational database
Why biological databases?
• exponential growth in biological data • data are no longer published in a conventional manner, but directly submitted to databases -- genomic sequences -- 3D structures -- 2D gel analysis -- MS analysis -- microarrays
• essential tools for biological research
– the only way to publish massive amounts of data without using all the paper in the world
The first database that emerged concentrated on
collecting and annotating nucleotide and protein sequences generated by the early sequencing techniques
Number of different biological databases
more than 1000
Size of databasess - variable
< 100 Kb to >20Gb
• DNA: >20 Gb
• protein: 1 Gb
• 3D structure: 5 Gb
Update frequency
daily to annually to seldom to forget about it
• usually accessible through the web
Some databases in the field of molecular biology
Categories of databases for life sciences
NCBI
GenBank is maintained at the National Center for Biotechnology Information
• Maryland, USA
EMBL
European Molecular Biology Laboratory
• at the European Bioinformatics Institute
• Cambridge, UK
DDBJ
DNA Databank of Japan
• at National Institute of Genetics
• Mishima, Japan
Objectives of these databases
EMBL, GenBank, DDBJ
Literature databases
Bookshelf
a collection of searchable biomedical books linked to PubMed
• literature database
PubMed
allows searching by author names, journal titles, and a new Preview/Index option
• provides access to over 12 million MEDLINE citations dating back tot he mid 1960s
• includes History and Clipboard options which may enhance your search session
• literature database
OMIM
Online Mendelian Inheritance in Man
• database of human genes and genetic disorders (also OMIA)
• literature database