0% found this document useful (0 votes)

16 views14 pages

Sayan Sir Bio Informatics

Bioinformatics is an interdisciplinary field that merges biology, computer science, and mathematics to analyze biological data, focusing on areas such as genomics, proteomics, and drug discovery. The Central Dogma of Molecular Biology outlines the flow of genetic information from DNA to RNA to proteins, emphasizing the importance of molecular biology as an information science. Key applications of bioinformatics include disease diagnosis, personalized medicine, and agricultural improvements, while challenges include data management, computational complexity, and ethical concerns.

Uploaded by

sumit saha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views14 pages

Sayan Sir Bio Informatics

Uploaded by

sumit saha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Q1) What is Bioinformatics ?

Bioinformatics is a branch of science that coanalyzes and interprets biological data by

combiningiology, Computer Science, and Mathematics .

Bioinformatics is “the mathematical, statistical and computing methods that aim to solve
biological problems using DNA and amino acid sequences and related information.”

Q2) What is the Scope of Bioinformatics?

Bioinformatics can significantly influence in solving the following types of problems:

1. Prediction of 3-D structure based on linear genomic information, i.e., the study of structural
genomics.

2. Gene expression analysis, prediction of gene function, and establishment of gene libraries (functional
genomics).

3. The ability to use genome sequences to identify proteins and their functions, protein interactions,
modifications and functions, i.e., the field of proteomics.

4. Elucidating the function of a molecule based on its structure.

5. Simulating metabolism from the biochemical functions of an organism.

6. Molecular modeling and molecular dynamics are the methods to predict structurefrom function.
Several methods of machine learning are used to predict function from sequence structure.

7. Medical science would need to know the pathways to know which genomic changes could give rise to
each known inherited disease, i.e., identification of the gene causing disease, and also, genetic
therapies that can reverse disease phenotypes.

8. Data obtained from functional genomics and proteomics could be used in drug designing and
discovery.

Q)5) What is the Central Dogma of Molecular Biology ??Explain it with the suitable diagram
Replication
(DNA → DNA)
↓
Transcription
(DNA → RNA)
↓
Translation
(RNA → Protein)

The Central Dogma of Molecular Biology describes the flow of genetic information within a biological
system. It was first proposed by Francis Crick in 1958 and states that genetic information is transferred
from DNA to RNA to Protein in a unidirectional manner.

Steps of Central Dogma:

1. Replication – DNA makes a copy of itself (DNA → DNA).

2. Transcription – Information from DNA is transcribed into messenger RNA (DNA → RNA).
3. Translation – The information in RNA is translated to form proteins (RNA → Protein).

Q3) Why is Bioinformatics a Multidisciplinary Field?

Bioinformatics is a multidisciplinary field that combines biology, computer science, mathematics,
statistics, and chemistry to analyze and interpret biological data. It integrates genetics, molecular
biology, and biochemistry to study DNA, RNA, and proteins, while computer science and programming
provide algorithms and machine learning models to process large-scale biological datasets. Mathematics
and statistics help in data interpretation and predictive modeling, and chemistry and biophysics
contribute to drug discovery and molecular simulations. The field also plays a crucial role in healthcare
and biomedical sciences, aiding in disease diagnosis, personalized medicine, and vaccine development.
This interdisciplinary nature makes bioinformatics essential for solving complex biological problems,
from genome analysis to drug design.
Q4) Highlight different application and challenges of Bioinformatics

Applications of Bioinformatics

1. Genomics & Proteomics 🧬

○ DNA sequencing, genome annotation, and comparative genomics.

💊
○ Example: Human Genome Project for understanding genetic diseases.
2. Drug Discovery & Personalized Medicine
○ Identifying drug targets and designing precision medicine based on genetic profiles.

🏥
○ Example: AI-driven drug discovery and cancer treatment based on genetic mutations.
3. Disease Diagnosis & Healthcare
○ Identifying genetic disorders, predicting disease risks, and designing early detection
methods.

🌿
○ Example: COVID-19 genome sequencing for vaccine development.
4. Agricultural & Environmental Science
○ Improving crop yield, disease resistance, and sustainability through genetic studies.

🕵️‍♂️
○ Example: Genetically modified (GM) crops with higher resistance to pests.
5. Forensic Science & Evolutionary Biology
○ DNA fingerprinting in criminal investigations and tracing evolutionary relationships.

🏗️
○ Example: Identifying criminals using DNA profiling or studying ancient species.
6. Systems Biology & Synthetic Biology
○ Modeling biological networks and designing synthetic organisms for industrial
applications.
○ Example: Engineering bacteria for biofuel production.

Challenges of Bioinformatics

1. Big Data Management 📊

💻
○ Handling and analyzing massive biological datasets from sequencing technologies.
2. Computational Complexity

✔️
○ High processing power required for genome sequencing and protein modeling.
3. Data Accuracy & Quality

🔬
○ Errors in sequencing data and computational models can lead to incorrect predictions.
4. Interdisciplinary Knowledge Gap

⚖️
○ Requires expertise in biology, programming, statistics, and machine learning.
5. Privacy & Ethical Concerns

💰
○ Genetic data privacy and ethical issues related to human genome research.
6. Cost of Analysis & Infrastructure
○ High costs of sequencing technologies and computational resources.

Q6) How can Molecular Biology be considered as Information Science?

Molecular biology is the understanding of biological processes at the molecular level.
Understanding is built through the use of physicochemical laws. Molecular biology is
an amalgamation of genetics and biochemistry. Biological processes involve biomolecules
(lipids, nucleic acids, carbohydrates, proteins, etc.) that form biological structures (organelles,
membranes, tissues, etc.). These biomolecules are present in the organism because of the
expression of information residing in the genetic material (DNA). Molecular biology can,
therefore, be understood as an information science.

The connection between molecular biology and information science lies in the way biological
systems process and transmit information. Here's a breakdown of how molecular biology can be
considered an information science:

● DNA as Information Storage:

○ DNA is essentially a biological database, storing genetic information in the form of a
sequence of nucleotides. This sequence acts as a code, carrying instructions for building
and maintaining an organism.
○ This aligns with the core concept of information science, which deals with the storage,
retrieval, and processing of information.
● Information Transfer:
○ The processes of transcription and translation, where DNA is copied into RNA and then
RNA is used to synthesize proteins, are fundamentally information transfer processes.
○ These processes involve decoding, interpreting, and executing information, similar to how
computers process data.
● Genetic Code as a Language:
○ The genetic code, which dictates how nucleotide sequences translate into amino acid
sequences, can be viewed as a biological language.
○ This language has its own syntax and semantics, allowing for the precise communication
of biological information.
● Bioinformatics:
○ The rise of bioinformatics, a field that combines molecular biology with computer
science, further solidifies this connection.
○ Bioinformatics utilizes computational tools and algorithms to analyze vast amounts of
biological data, such as genome sequences and protein structures. This highlights the
information-centric nature of modern molecular biology.
● Systems Biology:
○ Systems biology looks at the interactions of biological systems as a whole.
This involves the modeling of complex networks of information flow within cells and
organisms. This is very much related to how information systems are looked at.

Q)7) Define DNA and RNA. Explain different Components of the DNA and RNA.
DNA (Deoxyribonucleic Acid):

Definition:
DNA is the hereditary material in most living organisms. It carries the genetic instructions used in the
growth, development, functioning, and reproduction of all known living organisms and many viruses.
It's structured as a double helix, consisting of two strands that wind around each other.
Components:
Nucleotides: DNA is made up of repeating units called nucleotides. Each nucleotide consists of three
parts:
Deoxyribose: A five-carbon sugar.
Phosphate group: A molecule containing phosphorus.
Nitrogenous base: One of four molecules:
Adenine (A)
Thymine (T)
Guanine (G)
Cytosine (C)

The DNA strands are held together by hydrogen bonds between the nitrogenous bases: A pairs with T, and
G pairs with C.
RNA (Ribonucleic Acid):

Definition:
RNA is a nucleic acid that plays various roles in protein synthesis and gene regulation.
Unlike DNA, RNA is typically single-stranded.
Components:
Nucleotides: RNA is also composed of nucleotides, but with some differences:
Ribose: A five-carbon sugar (slightly different from deoxyribose).
Phosphate group: Similar to DNA.
Nitrogenous base:
Adenine (A)
Uracil (U) (replaces thymine)
Guanine (G)
Cytosine (C)

In RNA, A pairs with U, and G pairs with C.

Q)8) Differentiate of DNA and RNA.

Feature DNA (Deoxyribonucleic Acid) 🧬 RNA (Ribonucleic Acid) 🧾

Full Form Deoxyribonucleic Acid Ribonucleic Acid

Strands Double-stranded (forms a double Single-stranded

helix)

Sugar Deoxyribose (lacks one oxygen Ribose (has an extra -OH group)
atom)

Nitrogenous Bases Adenine (A), Thymine (T), Guanine Adenine (A), Uracil (U) (instead of T),
(G), Cytosine (C) Guanine (G), Cytosine (C)

Function Stores genetic information and Helps in protein synthesis (mRNA, tRNA,
transmits it to offspring rRNA)

Location Found in the nucleus (and Found in the nucleus and cytoplasm
mitochondria)
Stability Highly stable due to Less stable (single-stranded and prone to
double-stranded structure degradation)

Replication Self-replicating (DNA replication) Synthesized from DNA (Transcription)

Types One type (DNA) Three main types: mRNA (messenger),

tRNA (transfer), rRNA (ribosomal)

Role in Protein Stores instructions for making Translates genetic code into proteins
Synthesis proteins

Q)9) What are the various types of Protein Database. Which are the most important types of Protein
Database.

1. Primary Protein Databases 🏛️

These databases store experimentally determined protein sequences and annotations.

● UniProt (Universal Protein Resource) – A comprehensive resource combining Swiss-Prot,

TrEMBL, and PIR-PSD. It provides protein sequences, annotations, and functional information.
● Swiss-Prot – High-quality, manually curated protein sequences.
● TrEMBL – Automatically annotated sequences not yet reviewed.
● PIR-PSD – Previously maintained by the Protein Information Resource (PIR), now merged into
UniProt.

2. Composite Protein Databases 📂

These databases combine and filter data from multiple primary sources to avoid redundancy and improve
completeness.

● OWL – A non-redundant protein sequence database compiled from Swiss-Prot and other
sources. It eliminates exact and very similar duplicates.

3. Functional and Domain-Based Protein Databases 🧬

These databases help in protein function and family classification.

● PROSITE – Contains short sequence patterns and profiles to identify biologically significant
protein sites.
● Pfam – A database of protein families defined as domains. It uses multiple sequence alignments
to classify new sequences into known families.
4. Structural Protein Databases 🏗️
These databases store 3D structures of proteins, RNA, DNA, and macromolecular complexes.

● PDB (Protein Data Bank) – The primary repository for 3D protein structures, determined mainly
by X-ray crystallography and NMR.
● CATH – Classifies proteins into four hierarchical levels:
○ Class (C) – Based on secondary structure composition.
○ Architecture (A) – Describes overall shape without considering connectivity.
○ Topology (T) – Groups proteins with similar connectivity patterns.
○ Homologous superfamily (H) – Proteins with a common evolutionary ancestor.
● SCOP – A manual and automated classification of protein structures, describing their structural
and evolutionary relationships.

Most Important Protein Databases

1. UniProt – Most comprehensive sequence and annotation database.

2. PDB – Essential for 3D protein structures.
3. Pfam – Important for classifying protein families and domains.
4. CATH & SCOP – Crucial for structural and evolutionary classification of proteins.

Q)1) Write the main purpose of a biological database.

Storing and Managing Large Biological Data – Organizing genomic, proteomic, structural, and metabolic
data in an accessible format.
Providing Efficient Data Retrieval – Allowing researchers to quickly search and retrieve relevant
sequences, structures, and annotations.
Facilitating Sequence Comparison and Alignment – Enabling comparative analysis of DNA, RNA, and
protein sequences to understand genetic relationships.
Supporting Functional Annotation – Helping scientists predict the functions of genes and proteins by
analyzing sequence similarities.
Aiding in Evolutionary and Phylogenetic Studies – Providing insights into evolutionary relationships
between different species.
Enhancing Drug Discovery and Biomedical Research – Assisting in disease research, biomarker
identification, and drug target discovery.
Standardizing Biological Information – Ensuring consistency in data representation and nomenclature
across the scientific community.
Integrating Multidisciplinary Data – Connecting genomic, proteomic, and metabolic data to provide a
complete picture of biological systems.
Facilitating Machine Learning and AI Applications – Enabling computational models to predict gene
functions, protein interactions, and disease mechanisms.
Promoting Open Science and Collaboration – Allowing researchers worldwide to share and access
biological data for scientific advancements.
Q)2) Explain the different highlighting features of biological databases.

Key Features of Biological Databases 🧬💻

Biological databases are designed to store, manage, and analyze biological information efficiently. Their
main highlighting features include:

1. Large-Scale Data Storage 📊

● Stores vast amounts of biological data, including genomic sequences, protein structures, and
metabolic pathways.
● Examples: GenBank (DNA sequences), UniProt (protein sequences), PDB (3D protein
structures).

2. Data Organization and Classification 📂

● Data is structured into categories like nucleotide sequences, protein sequences, gene
expression profiles, and metabolic pathways.
● Classification based on organism, function, structural features, and evolutionary relationships.

3. Fast and Efficient Data Retrieval 🔍

● Provides query systems to search for specific genes, proteins, or pathways.
● Uses keywords, accession numbers, or sequence similarity searches (e.g., BLAST).

4. Integration with Other Databases 🔗

● Cross-linked with other databases to provide comprehensive biological insights.
● Example: Swiss-Prot is linked to PDB for structural details, KEGG links genes to metabolic
pathways.

5. Data Redundancy Handling ❌

● Composite databases like OWL remove duplicate sequences to create non-redundant datasets.
● Ensures accurate and efficient data processing.

6. Sequence Alignment and Analysis Tools 🧬

● Many databases provide tools for sequence alignment (e.g., Clustal Omega, BLAST), motif
discovery, and structural predictions.
● Helps in functional annotation and evolutionary studies.

7. Standardized and Curated Data ✅

● Some databases (like Swiss-Prot) have manually curated and reviewed data, ensuring accuracy.
● Others (like TrEMBL) are automatically generated but later validated.
8. Free and Open Access 🌍
● Many databases (GenBank, UniProt, PDB) are freely accessible, promoting open science and
collaboration.
● Some proprietary databases (HGMD for genetic mutations) require subscriptions.

9. Regular Updates and Version Control 🔄

● Frequently updated to include newly discovered genes, proteins, and structures.
● Provides version history to track changes in annotations.

10. Support for Computational and AI-Based Research 🤖

● Many databases support machine learning and AI applications for disease prediction, drug
discovery, and protein structure prediction.

Q)3) Classify different schemas of biological databases.

There are six fundamental considerations in designing and using biological databases
(Figure 2.5). These are:
1. Type of data, e.g. sequences, structures, pathways, etc.
2. Technical design of the database, e.g. flat-file, relational database, etc.
3. Data entry and curation facilities, e.g. curated database versus a non-curated database.
4. Primary or derived data, e.g. database may be a primary database or a secondary one
or a linked database of several databases.
5. Ownership of the database, e.g. academic or commercial ownership.
6. Availability, e.g. available freely or paid database.
As biological databases have grown and record keeping is one of the key management
principles, this is usually done through the use of identifiers and accession codes as discussed
below.
Q)4) Give the example of the following biological databases: sequence, genome, microarray, metabolic,
chemical, structural, disease, functional, pathway.

Database Type Example Database Description

Sequence GenBank (NCBI) A comprehensive database of publicly

Databases available DNA and RNA sequences.

Genome Ensembl (EBI) Provides genome annotations for various

Databases species, including humans, mice, and plants.

Microarray GEO (Gene Expression Omnibus, Stores high-throughput gene expression and
Databases NCBI) microarray data.

Metabolic KEGG (Kyoto Encyclopedia of Contains information on metabolic pathways,

Databases Genes and Genomes) genes, and diseases.

Chemical PubChem (NCBI) Provides data on chemical compounds,

Databases bioactivity, and structure.

Structural PDB (Protein Data Bank) The primary repository for 3D structures of
Databases proteins, DNA, and RNA.

Disease OMIM (Online Mendelian A database of human genes and genetic

Databases Inheritance in Man) disorders.

Functional GO (Gene Ontology) Provides standardized functional annotations

Databases for genes and proteins.

Pathway Reactome A curated database of biological pathways in

Databases humans and other organisms.

It's helpful to categorize biological databases by the type of data they hold. Here are examples of
databases for different categories:

● Sequence Databases:

○ GenBank (NCBI): A comprehensive database of nucleotide sequences.

○ UniProt: A database of protein sequences and functional information.
○ EMBL-Bank (EBI): Another major nucleotide sequence database.
○ DDBJ (DNA Data Bank of Japan)
● Genome Databases:

○ Ensembl: A genome browser that provides access to vertebrate and other eukaryotic
genomes.
○ NCBI Genome: Provides access to a wide range of genome sequences.
○ UCSC Genome Browser: Another popular genome browser.
● Microarray Databases:

○ Gene Expression Omnibus (GEO) (NCBI): A public repository of microarray and other gene
expression data.
○ ArrayExpress (EBI): A database of functional genomics experiments.
● Metabolic Databases:

○ KEGG (Kyoto Encyclopedia of Genes and Genomes): A database of metabolic pathways

and other biological systems.
○ MetaCyc: A database of experimentally elucidated metabolic pathways.
○ HMDB (Human Metabolome Database)
● Chemical Databases:

○ PubChem (NCBI): A database of chemical molecules.

○ ChEMBL (EBI): A database of bioactive molecules with drug-like properties.
● Structural Databases:

○ Protein Data Bank (PDB): A database of 3D structures of proteins and nucleic acids.
● Disease Databases:

○ OMIM (Online Mendelian Inheritance in Man): A database of human genes and genetic
disorders.
○ ClinVar (NCBI): A database of genetic variations and their relationships to human health.
● Functional Databases:

○ Gene Ontology (GO): A database of gene functions.

● Pathway Databases:

○ Reactome: A database of biological pathways and processes.

○ KEGG pathways.

Q)5) Explain different data retrieval tools for biological databases.

Different data retrieval tools for biological databases include:

1. Text Term Searching:

● Entrez: Provides integrated access to nucleotide and protein sequences, 3D structures, genomic
mapping, and literature.
● LinkOut: A registry service that links Entrez data to external resources.
● Cubby: Allows users to store, update searches, and customize LinkOut displays.
● Citation Matcher: Helps find the PubMed ID or MEDLINE UID of articles in PubMed.

2. Sequence Similarity Searching:

● BLAST Homepage: Access to BLAST (Basic Local Alignment Search Tool) for sequence
alignment.
● Blink: Displays precomputed BLAST results for each protein sequence in Entrez.
● Network-client BLAST: A client program (Blastcl3) that allows bulk BLAST searches.

3. Taxonomy Tools:

● Taxonomy Browser: Searches NCBI’s taxonomy database.

● Taxonomy BLAST: Groups BLAST hits by source organism.
● TaxTable: Summarizes BLAST taxonomy data in graphical form.
● ProtTable: Provides a summary of protein-coding regions in a genome.
● TaxPlot: Compares genome similarities in a three-way plot.

4. Sequence Submission Tools:

● Sequin: A tool for data submission that includes ORF Finder and an alignment viewer.
● BankIt: A web-based tool for submitting simple sequence data.

5. SRS (Sequence Retrieval System):

A web-based system for browsing and retrieving data from multiple biological databases. It accesses:

● EMBL: A nucleotide sequence database.

● Swiss-Prot: A well-annotated protein sequence database.
● Macromolecular Structure Database: Stores macromolecular structural data.
● ArrayExpress: Stores gene expression data.
● ENSEMBL: Provides genome sequences and annotations.

6. DBGET:

An integrated database retrieval system with three main commands:

● bget: Retrieves database entries by identifier.

● bfind: Searches for entries using keywords.
● blink: Retrieves related database entries.

Q)6) What are the different challenges faced to retrieve the data from the biological databases?

Challenges in Retrieving Data from Biological Databases

Retrieving data from biological databases involves several challenges due to the complexity and diversity
of biological information. Some of the key challenges include:

1. Data Redundancy and Duplication

● Multiple databases store the same sequence information, leading to redundancy.

● Different naming conventions and identifiers make it difficult to cross-reference data.

2. Data Volume and Scalability

● Biological databases are growing rapidly due to advancements in sequencing technologies.

● Retrieving, processing, and managing large-scale genomic and proteomic data require high
computational resources.

3. Data Format Variability

● Different databases use different data formats (FASTA, GenBank, PDB, etc.).
● Compatibility issues arise when integrating data from multiple sources.

4. Lack of Standardization

● Differences in annotation standards and classification systems across databases create

inconsistencies.
● Incomplete or ambiguous metadata can lead to misinterpretation of results.

5. Data Retrieval Efficiency

● Searching large biological databases can be slow and resource-intensive.

● Querying complex relationships (e.g., protein-protein interactions, metabolic pathways) requires
advanced algorithms.

6. Updating and Versioning Issues

● Frequent database updates may lead to changes in sequence identifiers and annotations.
● Older studies may cite outdated data, leading to inconsistencies in research findings.

7. Data Access and Licensing Restrictions

● Some databases have restricted access due to proprietary rights or subscription-based models.
● Open-access databases may have limitations on data usage or redistribution.

8. Integration of Heterogeneous Data Sources

● Combining data from different sources (genomic, proteomic, clinical, etc.) is challenging.
● Requires robust bioinformatics tools to ensure accurate integration.
9. Computational Resource Requirements

● Large-scale data retrieval and analysis demand high-performance computing (HPC)

infrastructure.
● Cloud-based solutions are being adopted, but cost and security concerns remain.

10. Biological Complexity and Data Interpretation

● Biological data is highly complex, and retrieving meaningful insights requires domain expertise.
● Errors in annotation or incomplete datasets can mislead research outcomes.

Molecular Clock, PPT
No ratings yet
Molecular Clock, PPT
15 pages
Bioinformatics Question Answers
No ratings yet
Bioinformatics Question Answers
25 pages
Assignment4 22BI13350
No ratings yet
Assignment4 22BI13350
6 pages
Bioinformatics Practical File
No ratings yet
Bioinformatics Practical File
12 pages
Bio Informatics Mrinmoyee Mam
No ratings yet
Bio Informatics Mrinmoyee Mam
15 pages
Unit 4
No ratings yet
Unit 4
13 pages
01 Biological Backgound
No ratings yet
01 Biological Backgound
74 pages
A Review of Fractional-Order Models For Plant Epidemiology: Progress in Fractional Differentiation and Applications
No ratings yet
A Review of Fractional-Order Models For Plant Epidemiology: Progress in Fractional Differentiation and Applications
33 pages
Bioinf Lecture1-2
No ratings yet
Bioinf Lecture1-2
44 pages
Bioinformatics 1
No ratings yet
Bioinformatics 1
52 pages
Motivation Letter
No ratings yet
Motivation Letter
2 pages
Intro To Bioinformatics
No ratings yet
Intro To Bioinformatics
50 pages
Bioinformatics Basics Applications in Biological Science and Medicine 2nd Edition Lukas K. Buehler (Editor) Latest PDF 2025
No ratings yet
Bioinformatics Basics Applications in Biological Science and Medicine 2nd Edition Lukas K. Buehler (Editor) Latest PDF 2025
105 pages
Aifs 2025 Flyer
No ratings yet
Aifs 2025 Flyer
1 page
Blast and Fasta Presentation
No ratings yet
Blast and Fasta Presentation
9 pages
Bio Info Tech
No ratings yet
Bio Info Tech
27 pages
Bioinfromatics Part - 2
No ratings yet
Bioinfromatics Part - 2
77 pages
BLOSUM Matrix Construction Guide
No ratings yet
BLOSUM Matrix Construction Guide
41 pages
Lec (1) - Introduction
No ratings yet
Lec (1) - Introduction
41 pages
Lecture Notes Biotechnology and Bioinformatics Mls 412 Bioinformatics Section
No ratings yet
Lecture Notes Biotechnology and Bioinformatics Mls 412 Bioinformatics Section
16 pages
Types of Phylogenetic Trees
No ratings yet
Types of Phylogenetic Trees
8 pages
3rd International Conference On Bioinformatics and Computational Studies (ICBCS 2025)
No ratings yet
3rd International Conference On Bioinformatics and Computational Studies (ICBCS 2025)
2 pages
Master MCQ S1 24 25
No ratings yet
Master MCQ S1 24 25
4 pages
Intro To BioInformatics Lec 1
No ratings yet
Intro To BioInformatics Lec 1
8 pages
Bioinformatics Lecture 1-Fall 2024
No ratings yet
Bioinformatics Lecture 1-Fall 2024
39 pages
Bioinformatics
100% (2)
Bioinformatics
104 pages
BIF401 Midterm Short Notes
No ratings yet
BIF401 Midterm Short Notes
45 pages
Computational Biology: by Safynaz Abdel-Fattah Sayed Computer Science Department
No ratings yet
Computational Biology: by Safynaz Abdel-Fattah Sayed Computer Science Department
36 pages
Module I
No ratings yet
Module I
65 pages
Bioinformatics (Final)
No ratings yet
Bioinformatics (Final)
41 pages
Unit-1 Bioinformatics
No ratings yet
Unit-1 Bioinformatics
17 pages
Le Sy Vin Et Al - 2007 - Towards Phylogenomics Reconstruction
No ratings yet
Le Sy Vin Et Al - 2007 - Towards Phylogenomics Reconstruction
8 pages
Bif401 Highlighted Subjective Handouts by BINT - E - HAWA
No ratings yet
Bif401 Highlighted Subjective Handouts by BINT - E - HAWA
222 pages
T Coffee - Overview
No ratings yet
T Coffee - Overview
102 pages
Estimating Phylogenetic Trees With Phangorn (Version 1.6-0) : Klaus P. Schliep April 5, 2012
No ratings yet
Estimating Phylogenetic Trees With Phangorn (Version 1.6-0) : Klaus P. Schliep April 5, 2012
12 pages
Bioinfo Final Practical
No ratings yet
Bioinfo Final Practical
66 pages
7 HomologyModelling 12oct2020
No ratings yet
7 HomologyModelling 12oct2020
8 pages
Bioinformatics for Science Students
No ratings yet
Bioinformatics for Science Students
78 pages
Pulwama Attack
No ratings yet
Pulwama Attack
1 page
Bioin
No ratings yet
Bioin
34 pages
scRNA-seq Analysis JY Chen 09-13-2019 Share
No ratings yet
scRNA-seq Analysis JY Chen 09-13-2019 Share
27 pages
CE6068 Lecture 2
No ratings yet
CE6068 Lecture 2
95 pages
List of Open-Source Bioinformatics Software
No ratings yet
List of Open-Source Bioinformatics Software
5 pages
Introduction To Xenobots
No ratings yet
Introduction To Xenobots
7 pages
#1 Pendahuluan
No ratings yet
#1 Pendahuluan
134 pages
Unit I
No ratings yet
Unit I
11 pages
BCH 516-1
No ratings yet
BCH 516-1
32 pages
Introduction To Bioinformatics: Tolga Can
No ratings yet
Introduction To Bioinformatics: Tolga Can
21 pages
Local and Global Sequence Alignment 5+5 Examples
No ratings yet
Local and Global Sequence Alignment 5+5 Examples
10 pages
Introduction to Bioinformatics
No ratings yet
Introduction to Bioinformatics
14 pages
"If You Can't Do Bioinformatics, You Can't Do Biology", J.D. Tisdall, 2003
No ratings yet
"If You Can't Do Bioinformatics, You Can't Do Biology", J.D. Tisdall, 2003
12 pages
Problem: Last Updated: Dec 5 2020, 20:28
No ratings yet
Problem: Last Updated: Dec 5 2020, 20:28
4 pages
2022 Fall BCHM 3P02 Project+1+Outline
No ratings yet
2022 Fall BCHM 3P02 Project+1+Outline
3 pages
1.1 General
No ratings yet
1.1 General
74 pages
Book 2014 Samal Bioinformaticsmanual
No ratings yet
Book 2014 Samal Bioinformaticsmanual
120 pages
BTH 403-BTG407 Lecture 1
No ratings yet
BTH 403-BTG407 Lecture 1
6 pages
Bioinformatics Sequence Alignment
No ratings yet
Bioinformatics Sequence Alignment
90 pages
Bioinformatics:: Guide To Bio-Computing and The Internet
No ratings yet
Bioinformatics:: Guide To Bio-Computing and The Internet
34 pages
BT403 QP
No ratings yet
BT403 QP
2 pages
PLAZA 3.0: An Access Point For Plant Comparative Genomics
No ratings yet
PLAZA 3.0: An Access Point For Plant Comparative Genomics
8 pages
To Calculate Length of Vowel Words
No ratings yet
To Calculate Length of Vowel Words
17 pages
Lecture 1-2 Intro
No ratings yet
Lecture 1-2 Intro
24 pages
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
100% (2)
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
54 pages
Banner SBG
No ratings yet
Banner SBG
1 page
Bioinformation: Phylogenetic Analysis of Chloroplast Matk Gene From Zingiberaceae For Plant Dna Barcoding
No ratings yet
Bioinformation: Phylogenetic Analysis of Chloroplast Matk Gene From Zingiberaceae For Plant Dna Barcoding
4 pages
Sequences Alignments (Similarity & Homology)
No ratings yet
Sequences Alignments (Similarity & Homology)
32 pages
Profile HMMs in Sequence Alignment
No ratings yet
Profile HMMs in Sequence Alignment
4 pages
Bioinformatics Made Easy
No ratings yet
Bioinformatics Made Easy
232 pages
Lecture 1: INTRODUCTION: A/Prof. Ly Le School of Biotechnology Email: Office: RM 705
100% (1)
Lecture 1: INTRODUCTION: A/Prof. Ly Le School of Biotechnology Email: Office: RM 705
43 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
76 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
61 pages
Protein Tertiaty Structure Prediction
No ratings yet
Protein Tertiaty Structure Prediction
12 pages
Bioinformatics for Beginners
No ratings yet
Bioinformatics for Beginners
232 pages
01introduction PDF
No ratings yet
01introduction PDF
22 pages
Hop Indonesia
No ratings yet
Hop Indonesia
80 pages
Introduction to Bioinformatics Course
No ratings yet
Introduction to Bioinformatics Course
34 pages
Bioinformatics Career Scope India
No ratings yet
Bioinformatics Career Scope India
2 pages
Concepts of Bioinformatics PDF
100% (2)
Concepts of Bioinformatics PDF
20 pages
Lecture 01
No ratings yet
Lecture 01
20 pages
Complete Notes On Bioinformatics
No ratings yet
Complete Notes On Bioinformatics
17 pages
Into To Bioinfo
No ratings yet
Into To Bioinfo
53 pages
Download
No ratings yet
Download
19 pages
Introduction To NCBI Resources
No ratings yet
Introduction To NCBI Resources
39 pages
Lecture1 20060306 Kang
No ratings yet
Lecture1 20060306 Kang
46 pages
Bioinformatics and Functional Genomics, Second Edition. by Jonathan Pevsner
No ratings yet
Bioinformatics and Functional Genomics, Second Edition. by Jonathan Pevsner
9 pages
Test For Upload
No ratings yet
Test For Upload
25 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
Bioinformatics and Quantumcomputing: Bio Informatics
No ratings yet
Bioinformatics and Quantumcomputing: Bio Informatics
10 pages

Sayan Sir Bio Informatics

Uploaded by

Sayan Sir Bio Informatics

Uploaded by

Q1) What is Bioinformatics ?

Bioinformatics is a branch of science that coanalyzes and interprets biological data by

Q2) What is the Scope of Bioinformatics?

Bioinformatics can significantly influence in solving the following types of problems:

4. Elucidating the function of a molecule based on its structure.

5. Simulating metabolism from the biochemical functions of an organism.

Steps of Central Dogma:

1.​ Replication – DNA makes a copy of itself (DNA → DNA).

Q3) Why is Bioinformatics a Multidisciplinary Field?

1.​ Genomics & Proteomics 🧬

1.​ Big Data Management 📊

Q6) How can Molecular Biology be considered as Information Science?

●​ DNA as Information Storage:

In RNA, A pairs with U, and G pairs with C.

Q)8) Differentiate of DNA and RNA.

Feature DNA (Deoxyribonucleic Acid) 🧬 RNA (Ribonucleic Acid) 🧾

Strands Double-stranded (forms a double Single-stranded

Replication Self-replicating (DNA replication) Synthesized from DNA (Transcription)

Types One type (DNA) Three main types: mRNA (messenger),

1. Primary Protein Databases 🏛️

●​ UniProt (Universal Protein Resource) – A comprehensive resource combining Swiss-Prot,

2. Composite Protein Databases 📂

3. Functional and Domain-Based Protein Databases 🧬

Most Important Protein Databases

1.​ UniProt – Most comprehensive sequence and annotation database.

Q)1) Write the main purpose of a biological database.

Key Features of Biological Databases 🧬💻

1. Large-Scale Data Storage 📊

2. Data Organization and Classification 📂

3. Fast and Efficient Data Retrieval 🔍

4. Integration with Other Databases 🔗

5. Data Redundancy Handling ❌

6. Sequence Alignment and Analysis Tools 🧬

7. Standardized and Curated Data ✅

9. Regular Updates and Version Control 🔄

10. Support for Computational and AI-Based Research 🤖

Q)3) Classify different schemas of biological databases.

Database Type Example Database Description

Sequence GenBank (NCBI) A comprehensive database of publicly

Genome Ensembl (EBI) Provides genome annotations for various

Metabolic KEGG (Kyoto Encyclopedia of Contains information on metabolic pathways,

Chemical PubChem (NCBI) Provides data on chemical compounds,

Disease OMIM (Online Mendelian A database of human genes and genetic

Functional GO (Gene Ontology) Provides standardized functional annotations

Pathway Reactome A curated database of biological pathways in

○​ GenBank (NCBI): A comprehensive database of nucleotide sequences.

○​ KEGG (Kyoto Encyclopedia of Genes and Genomes): A database of metabolic pathways

○​ PubChem (NCBI): A database of chemical molecules.

○​ Gene Ontology (GO): A database of gene functions.

○​ Reactome: A database of biological pathways and processes.

Q)5) Explain different data retrieval tools for biological databases.

Different data retrieval tools for biological databases include:

1. Text Term Searching:

2. Sequence Similarity Searching:

●​ Taxonomy Browser: Searches NCBI’s taxonomy database.

4. Sequence Submission Tools:

5. SRS (Sequence Retrieval System):

●​ EMBL: A nucleotide sequence database.

An integrated database retrieval system with three main commands:

●​ bget: Retrieves database entries by identifier.

Challenges in Retrieving Data from Biological Databases

1. Data Redundancy and Duplication

●​ Multiple databases store the same sequence information, leading to redundancy.

2. Data Volume and Scalability

●​ Biological databases are growing rapidly due to advancements in sequencing technologies.

3. Data Format Variability

●​ Differences in annotation standards and classification systems across databases create

5. Data Retrieval Efficiency

●​ Searching large biological databases can be slow and resource-intensive.

6. Updating and Versioning Issues

7. Data Access and Licensing Restrictions

8. Integration of Heterogeneous Data Sources

●​ Large-scale data retrieval and analysis demand high-performance computing (HPC)

10. Biological Complexity and Data Interpretation

You might also like

1. Replication – DNA makes a copy of itself (DNA → DNA).

1. Genomics & Proteomics 🧬

1. Big Data Management 📊

● DNA as Information Storage:

● UniProt (Universal Protein Resource) – A comprehensive resource combining Swiss-Prot,

1. UniProt – Most comprehensive sequence and annotation database.

○ GenBank (NCBI): A comprehensive database of nucleotide sequences.

○ KEGG (Kyoto Encyclopedia of Genes and Genomes): A database of metabolic pathways

○ PubChem (NCBI): A database of chemical molecules.

○ Gene Ontology (GO): A database of gene functions.

○ Reactome: A database of biological pathways and processes.

● Taxonomy Browser: Searches NCBI’s taxonomy database.

● EMBL: A nucleotide sequence database.

● bget: Retrieves database entries by identifier.

● Multiple databases store the same sequence information, leading to redundancy.

● Biological databases are growing rapidly due to advancements in sequencing technologies.

● Differences in annotation standards and classification systems across databases create

● Searching large biological databases can be slow and resource-intensive.

● Large-scale data retrieval and analysis demand high-performance computing (HPC)