Introduction Bio Informatics Lecture Notes
Introduction Bio Informatics Lecture Notes
Kristen Amuzzini
1
© 2003 The MathWorks, Inc.
Presentation Layout
• Sequence analysis
• Base calling algorithm design, sequence alignment,
sequence building algorithms
• Microarray analysis
• Image processing, QA/QC, data normalization, data analysis
• Proteomics
• Mass Spectrometry signal processing, protein marker
identification and classification, peptide sequence
identification, 2D-Gel image analysis
• Systems Biology
• Interaction network identification, simulation of metabolic
pathways, flux analysis
• Bioinformatics Teaching
• MIT, Stanford, Cornell, Carnegie Mellon, …
• Research
• Sequencing
• Base calling algorithm design
• Sequence analysis
• Computational biolinguistics
• Microarray analysis
• Statistical modeling of microarrays
• Proteomics
• Statistical modeling of protein-protein interaction
• Systems Biology
• Flux Analysis
11
© 2003 The MathWorks, Inc.
The MathWorks Product Family
Integrated for:
technical computing, data analysis and visualization
system modeling and simulation
implementation of real-time embedded software
Blocksets
Code Generation
Stateflow
Stateflow
Toolboxes
PC-based real-time
systems
DAQ cards
Instruments Desktop Applications
Databases and files Automated Reports
Financial Datafeeds
• File I/O
• FASTA, PDB, SCF, GPR, GAL
• Web Connectivity
• GenBank, EMBL, PIR, PDB 212 PYESFTFPELMRKGSYNPVTHIYTAQDVKEVIEYARLRGIR
| | | :| | | : |: | : : : |: | | | : | |
| : | :: | ::
• Sequence Analysis & Alignment 321 PYISRYYPELAVHGAYSE -SETYSEQDVREVAEFAKIYGVQ
• Needleman-Wunsch, Smith-Waterman
• DNA/RNA/AA conversions, pattern searching
• Microarray Normalization & Visualization
• Lowess, global mean, MAD (median absolute deviation)
• Protein Visualization
• Atomic composition, molecular weight, hydrophobicity profile
Launchpad:
Start other tools and
demos
Command Window
Workspace
Browser:
See your data
Command
History
Reference:
© 2003
DeRisi, JL, Iyer, VR, Brown, PO. "Exploring the metabolic and genetic control of gene expression on a genomic scale." Science. 1997 Oct 24;278(5338):680-6. The MathWorks, Inc.
Integrating and Deploying
Integrating
Developing
and Deploying
and Deploying
Bioinformatics
Bioinformatics
Tools with
Bioinformatics Tools with MATLAB
ApplicationsMATLAB
with MATLAB
17
© 2003 The MathWorks, Inc.
Connecting to MATLAB
C/C++
Java Excel / COM
Perl
Database
Toolbox
Web
Instrument Control
Data Acquisition File I/O
Image Acquisition
Stand-alone Web
Data I/O
• Import Excel ranges
into MATLAB
• Export MATLAB data into
Excel ranges
• Evaluate MATLAB Statements in
Excel
MLPutMatrix("Genes",A2:A43)
MLPutMatrix("TimeSteps",B1:H1)
MLEvalString("clustergram(data,'RowLabels',…
Genes,'ColLabels',TimeSteps)")
© 2003 The MathWorks, Inc.
What else could you do?
23
© 2003 The MathWorks, Inc.
Industry Issues & Solutions
• MATLAB Central
– File exchange and newsgroup access for MATLAB and
Simulink users
– www.mathworks.com/matlabcentral
– Access to comp.soft-sys.matlab