genomic

SNP Filtering with pyGeno

Looking over the contents of our growing blog (good job guys !), it occured to me that we had not yet posted an article pertaining to the fantastic (and homegrown !) bioinformatics resource that is pyGeno. It turns out I need to use pyGeno to generate data and it's also my turn to write a blog post, how convenient ! I'll focus the article on writing a SNP filter, which can be a bit surprising the first time you try [...]

By |2017-04-29T17:57:51+00:00December 9, 2016|Categories: Bioinformatics, Python|Tags: , |0 Comments

Generating Synthetic Genomic Data

Applying statistical methods is a large part of the work of a bioinformatician. Apart from some more classical techniques, machine learning algorithms are also regularly applied to clinical and biological data (notably, clustering techniques such as k-means). Some techniques such as artificial neural networks have recently found great success in areas such as image recognition and natural language processing. However, these techniques do not perform as well on small datasets with high dimensionality, a problem known as "the curse of dimensionality". [...]

By |2017-04-29T23:00:58+00:00January 7, 2016|Categories: Bioinformatics, Python|Tags: , |0 Comments

Draw me a Circos

How pretty would that look in my article? Very Pretty! As well as being informative! You might want to use a Circos for your own personal analysis or as an article figure. In both cases, this kind of representation is useful when it comes to visualizing data in a more global or complete manner:  you can have multiple types of data ranging across various chromosomal sequences. However, as wonderful and exciting the idea of having your own personal Circos might [...]

By |2017-05-01T10:03:50+00:00August 20, 2015|Categories: Data Visualization|Tags: , , |0 Comments