<?xml version="1.0" encoding="iso-8859-1"?>
<!-- generator="FeedCreator 1.7.2" -->
<rss version="2.0">
    <channel>
        <title>MedWorm: Bioinformaticians</title>
        <description>MedWorm.com provides a medical RSS filtering service. Over 6000 RSS medical sources are combined and output via different filters. This feed contains the latest headlines from journals and sites in the Bioinformaticians category.</description>
        <link><![CDATA[http://www.medworm.com/blogs/index.php/Bioinformaticians/132/]]></link>
        <lastBuildDate>Wed, 01 Feb 2012 09:22:25 +0100</lastBuildDate>
        <item>
            <title>Inside the Variation Toolkit: Tools for Gene Ontology</title>
            <link>http://www.medworm.com/index.php?rid=5645851&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2012%2F01%2Finside-variation-toolkit-tools-for-gene.html</link>
            <description>GeneOntologyDbManager is a C++ tool that is part of my experimental Variation Toolkit.
This program is a set of tools for GeneOntology, it is based on the sqlite3 library.




Download
Download the sources from Google-Code using subversion:....
svn checkout http://variationtoolkit.googlecode.com/svn/trunk/ variationtoolkit-read-only
... or update the sources of an existing installation... 
cd (Source: YOKOFAKUN)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5645851</comments>
            <pubDate>Tue, 31 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5645851</guid>        </item>
        <item>
            <title>Does Illlmina Also Have A Homopolymer Problem?</title>
            <link>http://www.medworm.com/index.php?rid=5645850&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2012%2F01%2Fdoes-illlmina-also-have-homopolymer.html</link>
            <description>One of the most widely-publicized error modes with Ion Torrent and 454 sequencing has been the challenge of correctly counting the number of bases in homopolymer runs.  Because these chemistries use non-terminating nucleotides, polymerase is free to add as many as possible.  Unfortunately, the signal linearity breaks down, making it difficult to correctly count.  Ion Torrent today released a note on homopolymers, but rather than plowing this well-trod ground it goes for a less publicized problem: Illumina having a more specific challenge in this department.  The note is available on the Ion Community, free registration required.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5645850</comments>
            <pubDate>Mon, 30 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5645850</guid>        </item>
        <item>
            <title>Inside the variation toolkit: VCF2XML</title>
            <link>http://www.medworm.com/index.php?rid=5637498&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2012%2F01%2Finside-variation-toolkit-vcf2xml.html</link>
            <description>vcf2xml is C++ tool that is part of my Variation Toolkit.
It transforms a &quot;Variant Call Format document&quot; to XML, so it can be later processed with xslt, xquery, etc...









Dependencies

libxml http://xmlsoft.org/


Download
Download the sources from Google-Code using subversion:....
svn checkout http://variationtoolkit.googlecode.com/svn/trunk/ variationtoolkit-read-only
... or update the (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5637498</comments>
            <pubDate>Sat, 28 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5637498</guid>        </item>
        <item>
            <title>Insert your VCFs in a sqlite database.</title>
            <link>http://www.medworm.com/index.php?rid=5637499&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2012%2F01%2Finsert-your-vcfs-in-sqlite-database.html</link>
            <description>vcf2sqlite is C++ tool that is part of my Variation Toolkit.
It inserts a &quot;Variant Call Format document&quot; (VCF) into a sqlite3 database.







Download
Download the sources from Google-Code using subversion:....
svn checkout http://variationtoolkit.googlecode.com/svn/trunk/ variationtoolkit-read-only
... or update the sources of an existing installation... 
cd variationtoolkit
svn update
... and (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5637499</comments>
            <pubDate>Sat, 28 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5637499</guid>        </item>
        <item>
            <title>Wiider postmortem</title>
            <link>http://www.medworm.com/index.php?rid=5637497&amp;cid=d_132_132_f&amp;fid=35021&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2FYourBonesGotALittleMachine%2F%7E3%2FObjIAN3kXyo%2F</link>
            <description>I always intended to write this postmortem earlier &amp;#8230; now three years after development ceased, I&amp;#8217;m finally getting around to it. Warning &amp;#8211; retrospective rambling ahead.
In mid 2007, Nintendo released the Opera-powered browser for their Wii gaming console which they called the Internet Channel. For many people, including myself, this was the first time they had been able to use &amp;#8220;Internet on the TV&amp;#8221;. Because of the typical viewing distance, low resolution for CRT-based televisions, and the unique navigation interface using the Wiimote, many web sites were functional but not particularly comfortable to use. Many sites targeted at desktop PCs were too complex and heavyweight for the Internet Channel, fonts were often too small such that cumbersome zooming and scr...</description>
            <author>Your bones got a little machine.</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5637497</comments>
            <pubDate>Sat, 28 Jan 2012 03:00:54 +0100</pubDate>
            <guid isPermaLink="false">5637497</guid>        </item>
        <item>
            <title>de-Bruijn assembler</title>
            <link>http://www.medworm.com/index.php?rid=5637500&amp;cid=d_132_132_f&amp;fid=35036&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fblogspot%2FLuPcC%2F%7E3%2FVbXmOaUEtog%2Feuler.html</link>
            <description>In the last post&amp;nbsp;I talked about the overlap-layout-consensus (OLC) way of Genome assembly. The approach which is (really!) getting popular these days is the other one, de-Bruijn-graphs (DBG). It is based on the simple idea of converting the hard problem of finding&amp;nbsp;Hamiltonian&amp;nbsp;to relatively simpler&amp;nbsp;Eulerian&amp;nbsp;for assembling biological sequences. There is a beautiful tutorial like introduction co-authored by the 'father' of this idea, Prof. Pevzner.

To put it naively, &amp;nbsp;in contrast with OLC approach where one represents the reads as vertex in graph which are connected if the end of the read representing the first vertex overlaps with the start of the second one, DBG&amp;nbsp;approach has the overlapping parts of the reads as&amp;nbsp;the vertices and edges representing th...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Bioinformatics Latest News</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5637500</comments>
            <pubDate>Fri, 27 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5637500</guid>        </item>
        <item>
            <title>Reproducible research: three links that made me think</title>
            <link>http://www.medworm.com/index.php?rid=5637496&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2012%2F01%2F27%2Freproducible-research-three-links-that-made-me-think%2F</link>
            <description>I&amp;#8217;m constantly amazed, bemused and troubled by how little published scientific research is genuinely reproducible, in that you or I (or even the original authors) could go back and check the results. Three examples from around the Web converged in my mind this week.

Software availability
A BioStar user asks: where is the software for the method described in a Science article, &amp;#8220;A Composite of Multiple Signals Distinguishes Causal Variants in Regions of Positive Selection.&amp;#8221;
No-one can find it on the Web; the best we can do is a press release from 2010 stating that the software &amp;#8220;should soon be available.&amp;#8221; Why do the so-called flagship journals and their reviewers have so little interest in the methods used to generate data shown in papers? It&amp;#8217;s beyond my c...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5637496</comments>
            <pubDate>Fri, 27 Jan 2012 01:20:45 +0100</pubDate>
            <guid isPermaLink="false">5637496</guid>        </item>
        <item>
            <title>Roche Guns For Illumina</title>
            <link>http://www.medworm.com/index.php?rid=5637495&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2012%2F01%2Froche-guns-for-illumina.html</link>
            <description>Due to a business dinner &amp; general exhaustion, I turned in early last night &amp; was caught unaware this morning of the big news: Roche is making a hostile takeover bid for Illumina.  Ugh!Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5637495</comments>
            <pubDate>Wed, 25 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5637495</guid>        </item>
        <item>
            <title>Science Online 2012 on Twitter</title>
            <link>http://www.medworm.com/index.php?rid=5627099&amp;cid=d_132_132_f&amp;fid=35015&amp;url=http%3A%2F%2Fnftb.net%2F%3Fp%3D109</link>
            <description>Science Online 2012, held in at the NC state university in Raleigh, NC, last week was a transforming event. The 450 attendees were a colorful mix of science writers, journalists, researchers, educators and artists. Apparently, the ratio of scientists dropped in its 6th installment. But the love of Science in the air would have graced any proper scientific meeting. Much of it has already been praised in readable form and can be accessed in a wiki repository. So why was Science Online 2012 so great? The reliable WLAN, the conference hall, the sessions run by the usual suspects, more or less prepared? I fail to answer that question precisely but coming back  I realized I changed my views on many issues in science communication profoundly. When I wrote it up, it sounded as shallow as th...</description>
            <author>Notes from the biomass</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5627099</comments>
            <pubDate>Mon, 23 Jan 2012 22:36:29 +0100</pubDate>
            <guid isPermaLink="false">5627099</guid>        </item>
        <item>
            <title>Sequencing Technology Fireworks</title>
            <link>http://www.medworm.com/index.php?rid=5596174&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2012%2F01%2Fsequencing-technology-fireworks.html</link>
            <description>I actually awoke today expecting an exciting press release, but I sure wasn&amp;#39;t prepared for the big announcements from Ion and Illumina.  Not that they were totally unexpected, but there&amp;#39;s a huge difference between speculation and announced products (which, of course, are hugely different from ones you can actually buy!)Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5596174</comments>
            <pubDate>Tue, 10 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5596174</guid>        </item>
        <item>
            <title>A CGI-version of samtools tview.</title>
            <link>http://www.medworm.com/index.php?rid=5571389&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2012%2F01%2Fcgi-version-of-samtools-tview.html</link>
            <description>I've created a lightweight CGI-based web-application for samtools tview. This C++ program named ngsproject.cgi uses the samtools api, it allows any user to visualize all the alignments in a given NGS project. The projects and their BAMS are defined on the server side using a simple XML document. e.g: 


 
 /home/lindenb/samtools-0.1.18/ (Source: YOKOFAKUN)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5571389</comments>
            <pubDate>Sat, 07 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5571389</guid>        </item>
        <item>
            <title>Ion Torrent Pairs: To What End?</title>
            <link>http://www.medworm.com/index.php?rid=5596175&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2012%2F01%2Fion-torrent-pairs-to-what-end.html</link>
            <description>Ion Torrent quietly released a set of paired end datasets over the holiday break.  This is a bit embarassing for me, as in my last post on Ion I stated the platform &amp;quot;will probably never have paired ends&amp;quot; and in fact Ion had already announced the protocol.  Oy!  I also missed their mate pair protocol being released, though the document itself is another victim of Ion&amp;#39;s incredibly counterproductive security policy.  If you don&amp;#39;t own a PGM, you can&amp;#39;t access the document -- never mind if you are trying to plan for a potential buy or are preparing a library for a friend/collaborator to run.  Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5596175</comments>
            <pubDate>Sat, 07 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5596175</guid>        </item>
        <item>
            <title>The Variation Toolkit</title>
            <link>http://www.medworm.com/index.php?rid=5571390&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2012%2F01%2Fvariation-toolkit.html</link>
            <description>During the last weeks, I've worked on an experimental C++ package named The Variation Toolkit (varkit). It was originally designed to provide some command lines equivalent to knime4bio but I've added more tools over time. Some of those tools are very simple-and-stupid ( fasta2tsv) , reinvent the wheel (&quot;numericsplit&quot;), are part of an answer to biostar, are some old tools (e.g. bam2wig) that have (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5571390</comments>
            <pubDate>Thu, 05 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5571390</guid>        </item>
        <item>
            <title>Hamiltonian Assembler</title>
            <link>http://www.medworm.com/index.php?rid=5571391&amp;cid=d_132_132_f&amp;fid=35036&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fblogspot%2FLuPcC%2F%7E3%2F2vQKpKnzJHU%2Fhamiltonian-path-assembler.html</link>
            <description>Just came across Andrew's Hamiltonian Cycle finder (HCF)

&amp;nbsp;get: 


&amp;nbsp;wget https://raw.github.com/ahh/ahh-toys/master/ham.sh


&amp;nbsp;simple check:


&amp;nbsp;bash ham.sh &amp;nbsp;a b&amp;nbsp;b c&amp;nbsp;c a &amp;nbsp;[followed by Ctrl-D should produce] 
&amp;nbsp;a b c

based purely on shell commands and thought of testing it out as a Genome assembler. The genome assembly problem is closely related to finding the shortest common superstring (S) of a given set of strings (s1, s2… sn). The superstring S corresponds to the genome and the set of strings being the short sequence reads produced by the sequencing machines.

The popular ways of approaching this problem are OLC and DBG (overlap-layout-consensus&amp;nbsp; and de-bruijn-graph). OLC approach is similar to HCF or the popular textbook example, the tr...</description>
            <author>Bioinformatics Latest News</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5571391</comments>
            <pubDate>Thu, 05 Jan 2012 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5571391</guid>        </item>
        <item>
            <title>2011 blog stats courtesy of WordPress.com</title>
            <link>http://www.medworm.com/index.php?rid=5558603&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2012%2F01%2F01%2F2011-blog-stats-courtesy-of-wordpress-com%2F</link>
            <description>The kind people at WordPress.com have prepared a 2011 annual report for this blog.


 
  Click here to see the complete report.
 
Filed under: this blog Tagged: 2011, summary, wordpress.com (Source: What You're Doing Is Rather Desperate)</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5558603</comments>
            <pubDate>Sun, 01 Jan 2012 00:14:16 +0100</pubDate>
            <guid isPermaLink="false">5558603</guid>        </item>
        <item>
            <title>A Fundamental Breakthrough in Protein Folding</title>
            <link>http://www.medworm.com/index.php?rid=5551449&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fprotein%2Fa-fundamental-breakthrough-in-protein-folding</link>
            <description>In my humble opinion, the biggest paper in protein folding from the last few years just got published in the wee hours of 2011. It is Protein 3D Structure Computed from Evolutionary Sequence Variation from Debora Marks, Lucy Colwell and colleagues (and when I say colleague I mean Chris Sander, which you should all know as a co-author of DSSP). This paper proves the tremendous result that the key structural contacts in a protein structure can be derived from a multiple sequence alignment. And that these contacts are sufficient to generate reliable structures of the protein. And big proteins at that. 

	I heard on the grapevine that this paper had a difficult passage to publication. I was both surprised and not surprised by this. I was surprised because this is such a fundamental result, it ...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5551449</comments>
            <pubDate>Wed, 28 Dec 2011 06:13:02 +0100</pubDate>
            <guid isPermaLink="false">5551449</guid>        </item>
        <item>
            <title>Year's End</title>
            <link>http://www.medworm.com/index.php?rid=5551448&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F12%2Fyears-end.html</link>
            <description>I hoped this year to push myself to blog more frequently and regularly.  Clearly I did better than some years, but not up to the standard I had hoped for.  I&amp;#39;ve also realized that I missed noting some significant personal milestones.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5551448</comments>
            <pubDate>Wed, 28 Dec 2011 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5551448</guid>        </item>
        <item>
            <title>Sequencing for relics from the Sanger era part 1: getting the raw data</title>
            <link>http://www.medworm.com/index.php?rid=5536345&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F12%2F22%2Fsequencing-for-relics-from-the-sanger-era-part-1-getting-the-raw-data%2F</link>
            <description>Sequencing in the good old daysIn another life, way back in the mists of time, I did a Ph.D. Part of my project was to sequence a gene from a bacterium, which encoded an enzyme involved in nitrate metabolism. It took the best part of a year to obtain ~ 2 000 bp of DNA sequence: partly because I was rubbish at sequencing, but also because of the technology at the time. It was an elegant biochemical technique called the dideoxy chain termination method, or &amp;#8220;Sanger sequencing&amp;#8221; after its inventor. Sequence was visualized by exposing radioactively-labelled DNA to X-ray film, resulting in images like the one at left, from my thesis. Yes, that photograph is glued in place. The sequence was read manually, by placing the developed film on a light box, moving a ruler and writing down the...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5536345</comments>
            <pubDate>Thu, 22 Dec 2011 04:28:44 +0100</pubDate>
            <guid isPermaLink="false">5536345</guid>        </item>
        <item>
            <title>Books Read 2011</title>
            <link>http://www.medworm.com/index.php?rid=5522665&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fbooks%2Fbooks-read-2011</link>
            <description>I started this year with the good intention of writing a paragraph or three for each book I read. A short review so to speak. I managed for about 15 books up to April. It&amp;#8217;s too much work to write about what I just read. Easier to read something new. So here&amp;#8217;s my reading list for this year (managed to read a lot of technical shit):

	1. Stendhal, The Red and the Black
2. Steig Larson, The Girl who kicked the Hornet’s Nest
3. Neil Howe and William Strauss, The Fourth Turning
5. Edith Wharton, “Age of Innocence”.
6. Gary Taubes, “Why we get Fat”
7. Nathan Haren &amp; Mike Cliffe Jones, “Beyond Blogging”
8. Jules Verne, “Voyage au Centre du Monde”
9. Rino Breerbaart, “Song Logic”
11. Mario Vargas Llosa, “Aunt Julia and the Scriptwriter”
12. Henry James, ...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5522665</comments>
            <pubDate>Mon, 19 Dec 2011 22:30:34 +0100</pubDate>
            <guid isPermaLink="false">5522665</guid>        </item>
        <item>
            <title>Magic Numbers and unit conversions in Structural Biology</title>
            <link>http://www.medworm.com/index.php?rid=5503449&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fprotein%2Fmagic-numbers-and-unit-conversions-in-structural-biology</link>
            <description>If you end up doing any kind of energy calculation in proteins or organic chemistry &amp;#8211; and that includes messing around with Molecular Dynamic trajectories &amp;#8211; you may end up dealing with actual numbers. 

	And that means you&amp;#8217;ll have to get your head around physical units and their conversions. 

	I&amp;#8217;ve spent days trying to figure out magic numbers in equations and source-code. Diving into the guts of someone else&amp;#8217;s source-code is not the nicest place to figure such things out. Do it enough, and you&amp;#8217;ll start seeing the same numbers pop up everywhere. As I&amp;#8217;ve never seen anyone bother to describe some very common magic numbers in biochemistry, I&amp;#8217;ll list a bunch of them right here. 

	First let&amp;#8217;s get acquainted with the standard units:

	
		po...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5503449</comments>
            <pubDate>Wed, 14 Dec 2011 21:00:55 +0100</pubDate>
            <guid isPermaLink="false">5503449</guid>        </item>
        <item>
            <title>#arseniclife: the genome</title>
            <link>http://www.medworm.com/index.php?rid=5503448&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F12%2F14%2Farseniclife-the-genome%2F</link>
            <description>It&amp;#8217;s about one year since the science story dubbed #arseniclife hit the headlines. November 30th saw the release of a draft genome sequence for Halomonas sp. GFAJ-1, the bacterium behind the furore.
As Iddo pointed out on Twitter, sequencing the DNA from GFAJ-1 is itself strong evidence against arsenate in the DNA backbone, since the sequencing chemistry would be highly unlikely to work in that case. However, if like me you think that a new microbial genome provides the most fun to be had in bioinformatics [*], you&amp;#8217;ll be excited by the availability of the data.
In this post then: where to get it, some very preliminary analysis and some things that you might like to to with it. Projects for your students, perhaps.
[*] note to self: why, then, am I working on colorectal cancer?

...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5503448</comments>
            <pubDate>Wed, 14 Dec 2011 04:35:32 +0100</pubDate>
            <guid isPermaLink="false">5503448</guid>        </item>
        <item>
            <title>Reflecting on a Year of Ion Torrent</title>
            <link>http://www.medworm.com/index.php?rid=5493763&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F12%2Freflecting-on-year-of-ion-torrent.html</link>
            <description>Ion Torrent released three more datasets this morning, all generated on their 318 chip.  One&amp;#39;s from E.coli but two are human genomic samples.  With approximately 1.2Gbp of raw data coming from these 318 chips (fron around 6 million quality filtered reads per chip), they are starting to move up the food chain in human genomics from pure amplicon sequencing to more complex small targeted resequencing efforts.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5493763</comments>
            <pubDate>Fri, 09 Dec 2011 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5493763</guid>        </item>
        <item>
            <title>New Look, Mobile Friendly</title>
            <link>http://www.medworm.com/index.php?rid=5485633&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fpost%2Fnew-look-mobile-friendly</link>
            <description>I&amp;#8217;ve been meaning to do this for a while, I&amp;#8217;ve updated the look of the blog. 

	The previous design was inspired by an old design of &amp;uuml;ber minimalist Ev Bogue. Back then I was wandering around the world as a nomadic minimalist. 

	I&amp;#8217;ve since been folded back in academia and it&amp;#8217;s time for a change. The new look has two goals:

	1. Emphasise the article nature of most of my (longish) posts
2. Responsive-Web-Design

	Resize the window and make it real narrow. You will see the design collapse into a linear layout. Perfect for reading on a mobile&amp;dagger; device.

	&amp;dagger;I really mean iPhones, but I need to stay in the good books of @pansapien. (Source: Trapped in the USA)</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5485633</comments>
            <pubDate>Thu, 08 Dec 2011 00:41:04 +0100</pubDate>
            <guid isPermaLink="false">5485633</guid>        </item>
        <item>
            <title>A Friday round-up</title>
            <link>http://www.medworm.com/index.php?rid=5467849&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F12%2F02%2Fa-friday-round-up%2F</link>
            <description>Just a brief selection of items that caught my eye this week. Note that this is a Friday as opposed to Friday, lest you mistake this for a new, regular feature.
1. R/statistics

ggbio
A new Bioconductor package which builds on the excellent ggplot graphics library, for the visualization of biological data.
R development master class
Hadley Wickham recently presented this course on R package development for my organisation. I was on parental leave at the time, otherwise I would have attended for sure.

2. Bioinformatics in the media
DNA Sequencing Caught in Deluge of Data
I described this NYT article as a &amp;#8220;surprisingly-good intro article&amp;#8220;. Michael Eisen described it as &amp;#8220;kind of silly&amp;#8220;.
I think we&amp;#8217;re both right. Michael&amp;#8217;s perspective is that of an expert i...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5467849</comments>
            <pubDate>Thu, 01 Dec 2011 23:36:06 +0100</pubDate>
            <guid isPermaLink="false">5467849</guid>        </item>
        <item>
            <title>Suggest some new terms for the EDAM Ontology for Bioinformatics</title>
            <link>http://www.medworm.com/index.php?rid=5467850&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F12%2Fsuggest-some-new-terms-for-edam.html</link>
            <description>EDAM is an ontology of general bioinformatics concepts, including topics and data types, formats, identifiers and operations.
Is your specific subject of research present in this ontology (e.g &quot;RNA-Seq&quot;) ? go and have a look at http://www.ebi.ac.uk/ontology-lookup/browse.do?ontName=EDAM. If it is not, feel free to suggest a new term in the form below. Your term might be included in the next (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5467850</comments>
            <pubDate>Thu, 01 Dec 2011 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5467850</guid>        </item>
        <item>
            <title>Special Topic Issue &quot;Webscience in medicine&quot;</title>
            <link>http://www.medworm.com/index.php?rid=5455993&amp;cid=d_132_132_f&amp;fid=39192&amp;url=http%3A%2F%2Fmeco-project.blogspot.com%2F2011%2F11%2Fspecial-topic-issue-webscience-in.html</link>
            <description>A Call for papers has been released for a special topic issue on Webscience in medicine. The special topic will appear within the Journal &quot;Methods of information in Medicine&quot;. (Source: Medical Ecosystem for Personalized Event-Based Surveillance)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Medical Ecosystem for Personalized Event-Based Surveillance</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5455993</comments>
            <pubDate>Tue, 29 Nov 2011 08:05:00 +0100</pubDate>
            <guid isPermaLink="false">5455993</guid>        </item>
        <item>
            <title>Boring, monotonous day-to-day tasks? That’s synonymous with bioinformatics.</title>
            <link>http://www.medworm.com/index.php?rid=5439628&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F11%2F24%2Fboring-monotonous-day-to-day-tasks-thats-synonymous-with-bioinformatics%2F</link>
            <description>In response to this question, I can only point out that J.C.R. Licklider figured it out over 50 years ago:
Despite the fact that there is a voluminous literature on thinking and problem solving, including intensive case-history studies of the process of invention, I could find nothing comparable to a time-and-motion-study analysis of the mental work of a person engaged in a scientific or technical enterprise. In the spring and summer of 1957, therefore, I tried to keep track of what one moderately technical person actually did during the hours he regarded as devoted to work. Although I was aware of the inadequacy of the sampling, I served as my own subject.
It soon became apparent that the main thing I did was to keep records, and the project would have become an infinite regress if the ke...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5439628</comments>
            <pubDate>Thu, 24 Nov 2011 05:18:38 +0100</pubDate>
            <guid isPermaLink="false">5439628</guid>        </item>
        <item>
            <title>Processing json data with apache velocity.</title>
            <link>http://www.medworm.com/index.php?rid=5430013&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F11%2Fprocessing-json-data-with-apache.html</link>
            <description>I've written a tool named &quot;apache velocity&quot; which parse json data and processes it with &quot;Apache velocity&quot; (a template engine ). The (javacc) source code is available here:


https://github.com/lindenb/jsandbox/blob/master/src/sandbox/VelocityJson.jj







Example
Say you have defined some classes using JSON:

[
 {
  &quot;type&quot;: &quot;record&quot;,
  &quot;name&quot;: &quot;Exon&quot;,
  &quot;fields&quot; : [
   {&quot;name&quot;: &quot;start&quot; (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5430013</comments>
            <pubDate>Sun, 20 Nov 2011 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5430013</guid>        </item>
        <item>
            <title>OSGi and Eclipse RCP Talk</title>
            <link>http://www.medworm.com/index.php?rid=5420819&amp;cid=d_132_132_f&amp;fid=35017&amp;url=http%3A%2F%2Feric.jain.name%2F2011%2F11%2F17%2Fosgi-and-eclipse-rcp-talk%2F</link>
            <description>I gave a talk at the Seattle Java User Group this week, where I talk about two technologies I use at work: The OSGi module system, and the Eclipse Rich Client Platform. Giving this talk was a good experience, and the audience was great. You can watch the talk below. (Source: eric.jain.name)</description>
            <author>eric.jain.name</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5420819</comments>
            <pubDate>Thu, 17 Nov 2011 22:50:40 +0100</pubDate>
            <guid isPermaLink="false">5420819</guid>        </item>
        <item>
            <title>&quot;VCF annotation&quot; with the NHLBI GO Exome Sequencing Project (JAX-WS)</title>
            <link>http://www.medworm.com/index.php?rid=5420820&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F11%2Fvcf-annotation-with-nhlbi-go-exome.html</link>
            <description>The NHLBI Exome Sequencing Project (ESP) has released a web service to query their data. &quot;The goal of the NHLBI GO Exome Sequencing Project (ESP) is to discover novel genes and mechanisms contributing to heart, lung and blood disorders by pioneering the application of next-generation sequencing of the protein coding regions of the human genome across diverse, richly-phenotyped populations and to (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5420820</comments>
            <pubDate>Wed, 16 Nov 2011 05:00:00 +0100</pubDate>
            <guid isPermaLink="false">5420820</guid>        </item>
        <item>
            <title>The infidelity of the theoretical protein backbone</title>
            <link>http://www.medworm.com/index.php?rid=5420821&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fprotein%2Fthe-infidelity-of-the-theoretical-protein-backbone</link>
            <description>Problems in protein simulations are often reduced to one of two categories:
	
		do we have a sufficiently accurate model of atomic interactions?
		have we sufficiently explored the conformational space of our proteins?
	

	If you talk to molecular modellers, they will try to tell you the problem lies wholly in conformational sampling. It is an article of their faith that force-fields are good enough. Our erstwhile modeller can then claim that the problem lies not in software but in hardware. It&amp;#8217;s just that their computers are too slow to finish their simulations properly.

	Well, I beg to differ. One major flaw in atomic force-fields, which has been swept under the carpet, is that atomic force-fields fail to properly model the protein backbone. And they&amp;#8217;ve failed to do so, for ...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5420821</comments>
            <pubDate>Tue, 15 Nov 2011 20:23:07 +0100</pubDate>
            <guid isPermaLink="false">5420821</guid>        </item>
        <item>
            <title>Some English pronunciation tips</title>
            <link>http://www.medworm.com/index.php?rid=5405643&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fperformance%2Fsome-english-pronounciation-tips</link>
            <description>Although I learnt English as a second language, I now wear it like a well-worn glove. But in science, there are tons of made-up poly-syllabic words, and I occassionally trip over some of them, such as: equilibrate, equilibrium and equilibration. 

	However, I&amp;#8217;ve been learning a bunch of languages the last few years, and through doing that, it&amp;#8217;s thrown English pronunciation in relief. I&amp;#8217;m starting to get a conscious handle on English pronunciation, and I&amp;#8217;ve discovered some useful rules that were not apparent to me before:

	
		English is an accented language. There is one emphasized syllable in every polysyllabic word, unlike say, Chinese or French where there are no syllable stresses. In Chinese each syllable is delineated quite clearly, giving a staccato feel, wher...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5405643</comments>
            <pubDate>Mon, 14 Nov 2011 11:59:06 +0100</pubDate>
            <guid isPermaLink="false">5405643</guid>        </item>
        <item>
            <title>Some English pronounciation tips</title>
            <link>http://www.medworm.com/index.php?rid=5399346&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fperformance%2Fsome-english-pronounciation-tips</link>
            <description>Although I learnt English as a second language, I now wear it like a well-worn glove. But in science, there are tons of made-up poly-syllabic words, and I occassionally trip over some of them, such as: equilibrate, equilibrium and equilibration. 

	However, I&amp;#8217;ve been learning a bunch of languages the last few years, and through doing that, it&amp;#8217;s thrown English prounciation in relief. I&amp;#8217;m starting to get a conscious handle on English prounciation, and I&amp;#8217;ve discovered some useful rules that were not apparent to me before:

	
		English is an accented language. There is one emphasized syllable in every polysyllabic word, unlike say, Chinese or French where there are no syllable stresses. In Chinese each syllable is delineated quite clearly, giving a staccato feel, wherea...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5399346</comments>
            <pubDate>Thu, 10 Nov 2011 06:55:47 +0100</pubDate>
            <guid isPermaLink="false">5399346</guid>        </item>
        <item>
            <title>MedEx 2011</title>
            <link>http://www.medworm.com/index.php?rid=5399347&amp;cid=d_132_132_f&amp;fid=39192&amp;url=http%3A%2F%2Fmeco-project.blogspot.com%2F2011%2F11%2Fmedex-2011.html</link>
            <description>On October 28, 2011 the Second International Workshop on Webscience and Information Exchange (MedEx 2011) took place in Glasgow. The audience had the opportunity to learn more about the facets and research questions related to webscience in medicine and healthcare. Three full papers and five position papers were presented covering a wide spectrum of research problems in medical natural language processing and webscience. Among others, two papers from the M-Eco context have been introduced.In her keynote, Wendy Chapman provided insights into her vision of enabling collaboration and sharing in clinical natural language processing. The second keynote was held by Sophia Ananiadou on advances of biomedical text mining for semantic search. The position papers provided lots of discussion points s...</description>
            <author>Medical Ecosystem for Personalized Event-Based Surveillance</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5399347</comments>
            <pubDate>Tue, 08 Nov 2011 08:10:00 +0100</pubDate>
            <guid isPermaLink="false">5399347</guid>        </item>
        <item>
            <title>The paper about BioStar has been published in &quot;PLoS Computational Biology&quot;</title>
            <link>http://www.medworm.com/index.php?rid=5381272&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F11%2Fpaper-about-biostar-has-been-published.html</link>
            <description>The article describing BioStar has been published in PLoS Computational Biology:





BioStar: An Online Question &amp; Answer Resource for the Bioinformatics Community

Laurence D. Parnell, Pierre Lindenbaum, Khader Shameer, Giovanni Marco Dall'Olio, Daniel C. Swan, Lars Juhl Jensen, Simon J. Cockell, Brent S. Pedersen, Mary E. Mangan, Christopher A. Miller, Istvan Albert. 2011
PLoS Comput Biol 7(10 (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5381272</comments>
            <pubDate>Tue, 01 Nov 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5381272</guid>        </item>
        <item>
            <title>Fitting Out</title>
            <link>http://www.medworm.com/index.php?rid=5357879&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F10%2Ffitting-out.html</link>
            <description>One of the attractions of my new shop was the possibility to see a biotech company built from the ground up.  Each of my previous companies had been a long-standing concern by the time I got there; even Codon Devices had a year plus under its belt and some equipment already mothballed.  The new venture moved into its first lab space last week, and as you can see from the picture all we have at the moment there are bare walls.Read more » (Source: Omics! Omics!)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5357879</comments>
            <pubDate>Thu, 27 Oct 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5357879</guid>        </item>
        <item>
            <title>A reference genome with or without the 'chr' prefix</title>
            <link>http://www.medworm.com/index.php?rid=5345712&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F10%2Freference-genome-with-or-without-chr.html</link>
            <description>The name of the chromosomes in the fasta files for the human genome are prefixed with 'chr' :
$ grep &quot;&gt;&quot; hg19.fa 
&gt;chr1
&gt;chr2
&gt;chr3
&gt;chr4
&gt;chr5
&gt;chr6
(...)
The FAIDX index for this fasta file looks like this:
chr1 249250621 6 50 51
chr2 243199373 254235646 50 51
chr3 198022430 502299013 50 51
chr4 191154276 704281898 50 51
chr5 180915260 899259266 50 51
chr6 171115067 1083792838 50 51
(...)
. (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5345712</comments>
            <pubDate>Fri, 21 Oct 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5345712</guid>        </item>
        <item>
            <title>Running a local JABAWS server for Jalview on Ubuntu (11.04 Natty)</title>
            <link>http://www.medworm.com/index.php?rid=5312946&amp;cid=d_132_132_f&amp;fid=35021&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2FYourBonesGotALittleMachine%2F%7E3%2FqhgexDm9lp4%2F</link>
            <description>The excellent Jalview sequence alignment visualization and editing tool has the ability to send a set of sequences to a multiple sequence alignment web service (&amp;#8220;JABAWS&amp;#8221;) and receive the results in a new alignment window. This is really convenient when you are doing lots of sequence analysis, and Geoff Barton&amp;#8217;s group at the University of Dundee provide a JABAWS server that Jalview will use by default.
But maybe the Dundee server is down. Or maybe you think your local machine will do things faster. Or maybe you work on über secret sequences in some Faraday cage bunker with no permanent network connection. In each of these cases, you may want to run your own local JABAWS server and use that instead. In this case, read on.

Download the JABAWS war file (direct link here)...</description>
            <author>Your bones got a little machine.</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5312946</comments>
            <pubDate>Fri, 14 Oct 2011 03:52:52 +0100</pubDate>
            <guid isPermaLink="false">5312946</guid>        </item>
        <item>
            <title>MiSeq Made Easy?</title>
            <link>http://www.medworm.com/index.php?rid=5322968&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F10%2Fmiseq-made-easy.html</link>
            <description>The first computer I ever tried to program was built from a kit by my brother and father.  The DATAC-1000 was a single-board machine, with that single printed circuit board about the area of a large laptop (image on page 9). Sporting a grand 1K of RAM, it was a grand machine.  User input-output was entirely through a set of binary touchpads and LEDs, though a cassette tape interface enabled storing and reading programs.  If I helped any with it, I might have sorted the resistors since I had just learned the color code.  The machine sported the same processor as some other machines of the time, such as the KIM-1 and the PET and even something called an Apple.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5322968</comments>
            <pubDate>Tue, 11 Oct 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5322968</guid>        </item>
        <item>
            <title>Swimming, Running, Hunting, and Meditating: the Evolutionary Origin of Mystical States</title>
            <link>http://www.medworm.com/index.php?rid=5303133&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fbooks%2Fswimming-running-hunting-and-meditating-the-evolutionary-origin-of-mystical-states</link>
            <description>I&amp;#8217;ve wondered often at how mystical states came about, states of mind that takes us out of the every day, such as the samadhi of buddhist meditation, or the zone of the long-distance runner. Assume, for argument sake, that such mystical states are an intrinsic part of our human heritage, then they must have evolved from a mental substratum dating back to a more primordial existence. The question then arises as to what possible use our distant quasi-monkey ancestors might have had for mystical-like states or mind. 

	Or to rephrase it another way, is there some kind of biological function that would require the evolution of mystical-like state of mind?

	To explore this, we need a reasonable map of the evolution of humans from our common ancestor with other apes. First though, we need...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5303133</comments>
            <pubDate>Mon, 10 Oct 2011 12:27:57 +0100</pubDate>
            <guid isPermaLink="false">5303133</guid>        </item>
        <item>
            <title>Knime4Bio: a set of custom nodes for the interpretation of NGS data with KNIME</title>
            <link>http://www.medworm.com/index.php?rid=5295795&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F10%2Fknime4bio-set-of-custom-nodes-for.html</link>
            <description>Our paper has just been published in Bioinformatics  :-)

http://bioinformatics.oxfordjournals.org/content/early/2011/10/07/bioinformatics.btr554.abstract





Knime4Bio: a set of custom nodes for the interpretation of Next Generation Sequencing data with KNIME.

   Pierre Lindenbaum, Solena Le Scouarnec, Vincent Portero and Richard Redon








Summary: Here, we describe Knime4Bio, a set of (Source: YOKOFAKUN)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5295795</comments>
            <pubDate>Fri, 07 Oct 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5295795</guid>        </item>
        <item>
            <title>Grouping mutations/Gene=f(sample)</title>
            <link>http://www.medworm.com/index.php?rid=5295796&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F10%2Fgrouping-mutationsgenefsample.html</link>
            <description>GroupByGene is a small C++ tool grouping the data:

CHROM
POS
REF
GENE
SAMPLE

by gene=f(sample). This tool is available on github:https://github.com/lindenb/ccsandbox/blob/master/src/groupbygene.cpp.
Example:
$ cat input.tsv

#CHROM	POS	REF	ALT	GENE	SAMPLE	
chr1	10	A	T	gene1	indi1
chr1	10	A	T	gene1	indi2
chr1	11	C	G	gene1	indi2
chr2	110	C	G	gene2	indi3
chr3	210	A	T	gene3	indi1
chr3	211	C	T	gene3 (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5295796</comments>
            <pubDate>Wed, 05 Oct 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5295796</guid>        </item>
        <item>
            <title>Verticalize: printing the input stream vertically.</title>
            <link>http://www.medworm.com/index.php?rid=5295797&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F10%2Fverticalize-printing-input-stream.html</link>
            <description>A useful tool: verticalize is a small C++ tool printing the input stream vertically. The source is available on github : https://github.com/lindenb/ccsandbox/blob/master/src/verticalize.cpp.
An Example with 1000genomes.org :
$ curl -s &quot;ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20100804/ALL.2of4intersection.20100804.sites.vcf.gz&quot;|\
gunzip -c | grep -v &quot;##&quot; |\
verticalize | head -n 30

&gt;&gt;&gt; (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5295797</comments>
            <pubDate>Wed, 05 Oct 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5295797</guid>        </item>
        <item>
            <title>Surveillance Workshop at GMDS</title>
            <link>http://www.medworm.com/index.php?rid=5272042&amp;cid=d_132_132_f&amp;fid=39192&amp;url=http%3A%2F%2Fmeco-project.blogspot.com%2F2011%2F09%2Fsurveillance-workshop-at-gmds.html</link>
            <description>On September 28, 2011, the workshop &quot;From indicator-based to event-based surveillance&quot; took place in conjunction with the GMDS Jahrestagung in Mainz. We had four interesting presentations. Tim Eckmanns gave an overview on indicator- and event based systems and showed the delays in traditional reporting on the EHEC outbreak in 2011. Jas Mantero from ECDC spoke about the event-based surveillace as performed by ECDC. Jens Linge from JRC presented the MediSys System as an example of an event-based system. Kerstin Denecke (L3S) introduced the M-Eco projet. The audience was very interested in the topic and asked motivating questions. All in all a nice workshop in the sunny city of Mainz. (Source: Medical Ecosystem for Personalized Event-Based Surveillance)</description>
            <author>Medical Ecosystem for Personalized Event-Based Surveillance</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5272042</comments>
            <pubDate>Fri, 30 Sep 2011 06:21:00 +0100</pubDate>
            <guid isPermaLink="false">5272042</guid>        </item>
        <item>
            <title>Thinking Outside the Box or Just Plain Nuts?</title>
            <link>http://www.medworm.com/index.php?rid=5272040&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F09%2Fthinking-outside-box-or-just-plain-nuts.html</link>
            <description>Please take the title in the spirit it is intended: as a bit lighthearted. Seeing the object pictured and reading the accompanying blog post from one of Jonathan Eisen&amp;#39;s graduate students.  It&amp;#39;s an unusual solution to a common problem, and gave me a good chuckle.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5272040</comments>
            <pubDate>Wed, 28 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5272040</guid>        </item>
        <item>
            <title>Boston's Boris Bikes</title>
            <link>http://www.medworm.com/index.php?rid=5272041&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F09%2Fbostons-boris-bikes.html</link>
            <description>When I discovered that my new gig would temporarily in Boston, I realized I had an opportunity to try out Boston&amp;#39;s new bikeshare program. Started this summer, Hubway consists of racks of bikes in public places which can be used for short hops around town. I like my folding bike, but on some rush hour trains it is very hard to find space for it, especially with some train conductors who are more interested in giving dirty looks than serving their passengers. Plus, it&amp;#39;s now quite dark on the last leg of my commute, and even if I had some really slick lights I don&amp;#39;t like riding even short distances in the dark.Read more » (Source: Omics! Omics!)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5272041</comments>
            <pubDate>Tue, 27 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5272041</guid>        </item>
        <item>
            <title>---</title>
            <link>http://www.medworm.com/index.php?rid=5259054&amp;cid=d_132_132_f&amp;fid=39192&amp;url=http%3A%2F%2Fmeco-project.blogspot.com%2F2011%2F09%2Fnelson-award-nominee-hypertext-2011-we.html</link>
            <description>Nelson Award Nominee @ Hypertext 2011 We were pleased to be nominated for the Nelson Award at Hypertext 2011 for our paper entitled, A Transfer Approach to Detecting Disease Reporting Events in Blog Social Media, Avaré Stewart, et.al.We continue our work of tackling the burden of manually labelling data and address the problems associated with building a supervised learner to classify frequently evolving, and variable blog content. We automatically classify outbreak reports to train a supervised learner, and the knowledge acquired from the learning process is then transferred to the task of classifying blogs.For more details about our work, please refer to the following link: http://dl.acm.org/citation.cfm?id=1996001 (Source: Medical Ecosystem for Personalized Event-Based Surveillance)</description>
            <author>Medical Ecosystem for Personalized Event-Based Surveillance</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5259054</comments>
            <pubDate>Tue, 27 Sep 2011 03:19:00 +0100</pubDate>
            <guid isPermaLink="false">5259054</guid>        </item>
        <item>
            <title>PostScript as a Programming Language for Bioinformatics: mynotebook</title>
            <link>http://www.medworm.com/index.php?rid=5259053&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F09%2Fpostscript-as-programming-language-for.html</link>
            <description>&quot;PostScript (PS) is an interpreted, stack-based programming language. It is best known for its use as a page description language in the electronic and desktop publishing areas.&quot;[wikipedia]. In this post, I'll show how I've used to create a simple and lightweight view 
of the genome.




Introduction: just a simple postscript program
The following PS program fills a rectangular gray shape; You (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5259053</comments>
            <pubDate>Mon, 26 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5259053</guid>        </item>
        <item>
            <title>Joining  genomic annotations files with the tabix API.</title>
            <link>http://www.medworm.com/index.php?rid=5248961&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F09%2Fjoining-genomic-annotations-files-with.html</link>
            <description>Tabix is a software that is part of the samtools package.
After indexing a file, tabix is able to quickly retrieve data lines overlapping genomic regions (see also my previous post about tabix). Here, I wrote a tool named jointabix that joins the data of a (chrom/start/end) file with a file indexed with tabix. I've posted the code on github at: https://github.com/lindenb/samtools-utilities/blob/ (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5248961</comments>
            <pubDate>Fri, 23 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5248961</guid>        </item>
        <item>
            <title>Transitions</title>
            <link>http://www.medworm.com/index.php?rid=5248960&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F09%2Ftransitions.html</link>
            <description>I went to an Infinity going-away lunch last week. We head off to some favorite local restaurant and order a modest (but delicious) meal on the company dime. The departee makes an impromptu speech, there are goodbyes and handshakes and usually a number of pleas to stay, both fictitious and heartfelt. Those staying wonder what could lure someone away from the very safe and green pastures of the company. I&amp;#39;ve been to many such lunches with Millennium and Infinity; with Codon the lunches tended to be group affairs as people were laid off in batches. Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5248960</comments>
            <pubDate>Sun, 18 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5248960</guid>        </item>
        <item>
            <title>Illumina Calls for a Flag on the Play</title>
            <link>http://www.medworm.com/index.php?rid=5222726&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F09%2Fillumina-calls-for-flag-on-play.html</link>
            <description>Continuing my sports analogies, but switching games, in my coverage of the benchtop sequencer war today.  Alas, I can&amp;#39;t refer to instant replay, as the usual set of procrastination excuses has resulted in this being filed very late after I was made aware of it (first by a comment in the blog, then by a friendly chap from Illumina alerting me).  In any case, Illumina has responded to Ion Torrent&amp;#39;s claims on long reads and overestimated MiSeq quality, and mostly done so by crying &amp;quot;Foul!&amp;quot;.Read more » (Source: Omics! Omics!)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5222726</comments>
            <pubDate>Wed, 14 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5222726</guid>        </item>
        <item>
            <title>The Wikipedia Template:Infobox_biodatabase is now integrated in DBPedia</title>
            <link>http://www.medworm.com/index.php?rid=5206757&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F09%2Fwikipedia-templateinfoboxbiodatabase-is.html</link>
            <description>In January 2011, I started the project Template:Infobox_biodatabase. The goal of this project is the annotation of the biological databases in wikipedia using an infobox. The pages annotated with this template have now been integrated into DBpedia 3.7 and it is now possible to query the data through a SPARQL endpoint.
(Note: during the process of writing the new pages in wikipedia, a few articles (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5206757</comments>
            <pubDate>Sun, 11 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5206757</guid>        </item>
        <item>
            <title>Customizing &quot;C printf&quot; to print a FASTA sequence.</title>
            <link>http://www.medworm.com/index.php?rid=5206758&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F09%2Fcustomizing-c-printf-to-print-fasta.html</link>
            <description>The GNU C library lets you define your own custom conversion specifiers
for printf template strings. In the current post, I will show how to add a new modifier for printf to print a fasta sequence. Say we have a C structure holding a FASTA sequence:
typedef struct fasta_t
	{
	char* name;
	char* sequence;
	}Fasta;
The new output conversion is registered with register_printf_specifier. (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5206758</comments>
            <pubDate>Fri, 09 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5206758</guid>        </item>
        <item>
            <title>Interacting with bioinformatics webservers using R</title>
            <link>http://www.medworm.com/index.php?rid=5206756&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F09%2F08%2Finteracting-with-bioinformatics-webservers-using-r%2F</link>
            <description>In an ideal world, all bioinformatics tools would be made available via the Web as a web service with an API, as well as a standalone package to download for local use. This is rarely the case and sometimes, even where one or the other is available, factors such as cost come into play. So we resort to web scraping; writing code to interact with the code that lies behind a web server so as to submit queries, retrieve and parse results.
Normally, I&amp;#8217;d use something like Ruby&amp;#8217;s Mechanize library for this purpose. However, where the purpose is to retrieve delimited data for analysis using R, I figured it was time to try and achieve the entire process within R. So here&amp;#8217;s how I used the RCurl and XML packages to interact with the WHAT IF server, which provides tools for the anal...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5206756</comments>
            <pubDate>Thu, 08 Sep 2011 12:49:42 +0100</pubDate>
            <guid isPermaLink="false">5206756</guid>        </item>
        <item>
            <title>Parsing a BAM file with javascript, yes we can. (Node.js and V8)</title>
            <link>http://www.medworm.com/index.php?rid=5206759&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F09%2Fparsing-bam-with-javascript-yes-we-can.html</link>
            <description>Node.js is an event-driven I/O server-side JavaScript environment based on V8, Google's open source JavaScript engine. In the current post I will describe how I've used Node/V8 to parse a BAM file. Here I've used node v0.5.5 and my code is hosted in a new git repository bionode.


Designing a C++ native extension for Node.js wrapping the bgzf format
BAM files are stored using the bgzf format. We (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5206759</comments>
            <pubDate>Tue, 06 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5206759</guid>        </item>
        <item>
            <title>Merging two XML files using the libxml streaming API.</title>
            <link>http://www.medworm.com/index.php?rid=5206760&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F09%2Fmerging-two-xml-files-using-libxml.html</link>
            <description>I just wrote a program appending one or more XML files to another XML file using the libxml streaming API. This program named mergexml is available on github at: mergexml.Usage$ mergexml -hmergexml: Pierre Lindenbaum PHD. 2011.Compilation: Sep 6 2011 at 19:54:22.Usage: mergexml [options] -i database.xml xml1, xml2, ....xmlnOptions: -i (required) -o (default:stdout) --replace replace xml-in (Source: YOKOFAKUN)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5206760</comments>
            <pubDate>Tue, 06 Sep 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5206760</guid>        </item>
        <item>
            <title>Genetic Education:</title>
            <link>http://www.medworm.com/index.php?rid=5182169&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F09%2Fgenetic-education.html</link>
            <description>   A study by the American Society of Human Genetics (ASHG) is published today suggesting that few U.S. states have adequate standards in place for genetics education.  I forgot to ask for a link to the article that would go live post-embargo, but it should be on the website of the journal (CBE Life Sciences Education)Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5182169</comments>
            <pubDate>Wed, 31 Aug 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5182169</guid>        </item>
        <item>
            <title>Will Max-Seq Gain Traction?</title>
            <link>http://www.medworm.com/index.php?rid=5182170&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F08%2Fwill-max-seq-gain-traction.html</link>
            <description>      At the beginning of the month, I had dinner with J Adams from Azco Biotech and some friends/colleagues of his and talked over the Max-Seq.   And yes, I did let them pick up the bill -- J wouldn&amp;#39;t let me pay for myself.  In Sequence did a nice piece on it the next week, so again I&amp;#39;ve blown an opportunity to scoop them.  Then somehow, between vacations and other distractions, this piece was stuck in blogger limbo.  But, there are some details I don&amp;#39;t see there and some color I think worth adding.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5182170</comments>
            <pubDate>Tue, 30 Aug 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5182170</guid>        </item>
        <item>
            <title>Wishing I Had Been A Referee: A Renaissance for Tagamet?</title>
            <link>http://www.medworm.com/index.php?rid=5174807&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F08%2Fwishing-i-had-been-referee-renaissance.html</link>
            <description>Back from vacation &amp; watching another wave of Hurricane Irene soak the area (curiously, the windiest times so far seem to be breaks in the rain).  August has not seen much attention paid to this space (indeed, I have one piece that has gestated nearly the whole month), so time to put the shoulder to the wheel. Just before my vacation, a pair of papers showed up in Science Translational Medicine which describe two attempts at drug re-positioning by transcriptional profiling.  The key concept is to take expression profiles for diseases and try to find drugs which appear to generate the opposite transcriptional pattern, with the theory that the drug could nudge the disease pattern back to a normal state.  This is an idea which has been kicking around for a while, and at one time was...</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5174807</comments>
            <pubDate>Sat, 27 Aug 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5174807</guid>        </item>
        <item>
            <title>Popular topics at the BioStar Q&amp;A site</title>
            <link>http://www.medworm.com/index.php?rid=5159569&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F08%2F23%2Fpopular-topics-at-the-biostar-qa-site%2F</link>
            <description>Conclusions? XML, ggplot2 and wordcloud are all great packages. And whilst so-called &amp;#8220;next-generation-sequencing&amp;#8221; might be all the rage, it&amp;#8217;s good to see the old stalwarts of bioinformatics hanging in there: BLAST, alignment, phylogenetics, Python and Perl. It will be interesting to see how tags change over time.
Filed under: bioinformatics, R, statistics, web resources Tagged: biostar, stackexchange (Source: What You're Doing Is Rather Desperate)</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5159569</comments>
            <pubDate>Tue, 23 Aug 2011 06:54:14 +0100</pubDate>
            <guid isPermaLink="false">5159569</guid>        </item>
        <item>
            <title>Monitoring PubMed retractions: updates</title>
            <link>http://www.medworm.com/index.php?rid=5130966&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F08%2F16%2Fmonitoring-pubmed-retractions-updates%2F</link>
            <description>PubMed cumulative retractions 1977-presentThere&amp;#8217;s been a recent flurry of interest in retractions. See for example: Scientific Retractions: A Growth Industry?; summarised also by GenomeWeb in Take That Back; articles in the WSJ and the Pharmalot blog; and academic articles in the Journal of Medical Ethics and Infection &amp; Immunity.
Several of these sources cite data from my humble web application, PMRetract. So now seems like a good time to mention that:

The application is still going strong and is updated regularly
I&amp;#8217;ve added a few enhancements to the UI; you can follow development at GitHub
I&amp;#8217;ve also added a long-overdue about page with some extra information, including the fact that I wrote it :)

Now I just need to fix up my Git repositories. Currently there&amp;#8217...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5130966</comments>
            <pubDate>Tue, 16 Aug 2011 05:51:03 +0100</pubDate>
            <guid isPermaLink="false">5130966</guid>        </item>
        <item>
            <title>A FUSE-based filesystem reproducing the NCBI Taxonomy hierarchy.</title>
            <link>http://www.medworm.com/index.php?rid=5125901&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F08%2Ffuse-based-filesyetem-reproducing-ncbi.html</link>
            <description>In this post I will show how I've used FUSE to create a new loadable filesystem for Linux, reproducing the tree of the NCBI Taxonomy.

From the Fuse Homepage:With FUSE it is possible to implement a fully functional filesystem in a userspace program. Features include:Simple library APISimple installation (no need to patch or recompile the kernel)Secure implementationUserspace - kernel interface (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5125901</comments>
            <pubDate>Fri, 12 Aug 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5125901</guid>        </item>
        <item>
            <title>Wikipedia OpenSearch and semantic bash completion</title>
            <link>http://www.medworm.com/index.php?rid=5118892&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F08%2Fwikipedia-opensearch-and-semantic-bash.html</link>
            <description>I wrote a tool that use the opensearch API for mediawiki to display the results of a query in wikipedia:$ opensearch Charles DarwCharles_Darwin  Charles Robert Darwin FRS (12 February 1809 – 19 April 1882) was an English...Charles_Darwin_University  Charles Darwin University (CDU) is an Australian public university wi...Charles_Darwin%27s_health  For much of his adult life, Charles Darwin's (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5118892</comments>
            <pubDate>Tue, 09 Aug 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5118892</guid>        </item>
        <item>
            <title>Quick tip: bash completion for Bioinformatics</title>
            <link>http://www.medworm.com/index.php?rid=5118893&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F08%2Fquick-tip-bash-completion-for.html</link>
            <description>The default behavior for the completion can be extended by creating a new file:
${HOME}/.bash_completion
in your home.
For example, you can write your own Bash Completion for samtools, bwa or your favourite tool by adding the following line in &quot;${HOME}/.bash_completion&quot;:
complete -f -X '!*.@(bam|sam|fasta|fa|fa.gz|fastq.gz|fasta.gz)' samtools bwa

Open a new bash and type:
$ samtools index (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5118893</comments>
            <pubDate>Mon, 08 Aug 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5118893</guid>        </item>
        <item>
            <title>Names in Collision</title>
            <link>http://www.medworm.com/index.php?rid=5107801&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F08%2Fnames-in-collision.html</link>
            <description>I will claim that I saw this coming, in that I&amp;#39;ve toyed with the basic skeleton of this post before.  But, I hadn&amp;#39;t gotten around to it -- but how could I miss this opportunity.  On a mailing list devoted to SAM, someone asked about a related topic to SAM, and one of the experts on the board replied with an electronic head-scratchingRead more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5107801</comments>
            <pubDate>Sun, 07 Aug 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5107801</guid>        </item>
        <item>
            <title>BioRuby development: feedback on using Git</title>
            <link>http://www.medworm.com/index.php?rid=5096848&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F08%2F05%2Fbioruby-development-feedback-on-using-git%2F</link>
            <description>Everyone likes constructive feedback. I received a couple of great comments on my previous post, which warrant a brief discussion.
@vlandham points out that when the main BioRuby repository updates, you&amp;#8217;ll want to update your local repository. Using git, you do that by adding a remote which points to the original repository, from which you can fetch updates and merge with your local version:

git remote add upstream https://github.com/bioruby/bioruby.git
# fetch/merge only when main repo updates
git fetch upstream
git merge upstream master

This is described at the GitHub help page Fork A Repo.
Michael points to an article titled A successful Git branching model. It suggests that when developing new features you create a feature branch (also called topic branch). This can help with t...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5096848</comments>
            <pubDate>Fri, 05 Aug 2011 03:15:57 +0100</pubDate>
            <guid isPermaLink="false">5096848</guid>        </item>
        <item>
            <title>A beginner’s guide to BioRuby development</title>
            <link>http://www.medworm.com/index.php?rid=5096849&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F08%2F04%2Fa-beginners-guide-to-bioruby-development%2F</link>
            <description>I&amp;#8217;m the &amp;#8220;biologist-turned-programmer&amp;#8221; type of bioinformatician which makes me a hacker, not a developer. Most of the day-to-day coding that I do goes something like this:
Colleague: Hey Neil, can you write me a script to read data from file X, do Y to it and output a table in file Z?
Me: Sure&amp;#8230; (clickety-click, hackety-hack&amp;#8230;) &amp;#8230;there you go.
Colleague: Great! Thanks.
I&amp;#8217;m a big fan of the Bio* projects and have used them for many years, beginning with Bioperl and more recently, BioRuby. And I&amp;#8217;ve always wanted to contribute some code to them, but have never got around to doing so. This week, two thoughts popped into my head:

How hard can it be?
There isn&amp;#8217;t much introductory documentation for would-be Bio* developers

The answer to the firs...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5096849</comments>
            <pubDate>Thu, 04 Aug 2011 13:31:35 +0100</pubDate>
            <guid isPermaLink="false">5096849</guid>        </item>
        <item>
            <title>Ion Throws A Long Punch At MiSeq</title>
            <link>http://www.medworm.com/index.php?rid=5096847&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F08%2Fion-throws-long-punch-at-miseq.html</link>
            <description>The benchtop sequencer wars are heating up!  Illumina and Life are engaged in a fierce war of pamphlets and datasets to convince the world that they have the edge.  I won&amp;#39;t attempt to give a complete play-by-play, but hit on the latest developments, which includes Ion releasing a dataset of 250+ bp reads.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5096847</comments>
            <pubDate>Wed, 03 Aug 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5096847</guid>        </item>
        <item>
            <title>The GPCR-G complex: the canonical structure of Cell Signaling (with interactive guide)</title>
            <link>http://www.medworm.com/index.php?rid=5086452&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fprotein%2Fthe-gpcr-g-complex-the-canonical-structure-of-cell-signaling</link>
            <description>embed_a1_dict = {
 &quot;pdb_id&quot;:&quot;3sn6&quot;,
 &quot;canvas_tag&quot;:&quot;embed_a1_tag&quot;,
 &quot;view_id&quot;:&quot;view:u74775&quot;,
 &quot;text_height&quot;:120,
}
embed_a2_dict = {
 &quot;pdb_id&quot;:&quot;3sn6&quot;,
 &quot;canvas_tag&quot;:&quot;embed_a2_tag&quot;,
 &quot;view_id&quot;:&quot;view:5xd1fr&quot;,
 &quot;text_height&quot;:80,
}
embed_a3_dict = {
 &quot;pdb_id&quot;:&quot;1azt&quot;,
 &quot;canvas_tag&quot;:&quot;embed_a3_tag&quot;,
 &quot;view_id&quot;:&quot;view:nid9qp&quot;,
 &quot;text_height&quot;:80,
}


	Perhaps you&amp;#8217;ve heard of the new structure of the GPCR-G complex. It&amp;#8217;s a seminal breakthrough, and I concur with Curious Wavefunction in that it&amp;#8217;s worthy of a Nobel prize. 

	So noobs, why is it important? According to some estimates, some 30% of our proteins are membrane proteins and the reason that they are so many is that membrane proteins are where most of the signal processors between our cells are found. 

	Our multi-cellularity d...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5086452</comments>
            <pubDate>Mon, 01 Aug 2011 23:27:19 +0100</pubDate>
            <guid isPermaLink="false">5086452</guid>        </item>
        <item>
            <title>ISMB coverage on Twitter? It’s possible there was…</title>
            <link>http://www.medworm.com/index.php?rid=5086448&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F08%2F01%2Fismb-coverage-on-twitter-its-possible-there-was%2F</link>
            <description>Peter writes:

I wonder if part of the drop off is live bloggers moving to platforms like Twitter? I can tell you it seemed like there were almost as many tweets for one SIG (#bosc2011) as for the whole of #ISMB / #ECCB2011, and I personally didn’t post anything to FriendFeed but posted lots on Twitter.

Well, there&amp;#8217;s a problem with using Twitter for analysis of conference coverage. Let&amp;#8217;s try searching for ISMB-related tweets using the twitteR package:

library(twitteR)
ismb &amp;lt;- searchTwitter(&amp;quot;ismb&amp;quot;, 1000)
length(ismb)
# [1] 30

If we can&amp;#039;t archive, how can anyone else?30? Are we using twitteR properly? Running the same search at the Twitter website gives roughly the same results, plus this unhelpful message.
I like Twitter &amp;#8211; as a real-time communicatio...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5086448</comments>
            <pubDate>Mon, 01 Aug 2011 04:38:17 +0100</pubDate>
            <guid isPermaLink="false">5086448</guid>        </item>
        <item>
            <title>The Metaphysical Countrygirl</title>
            <link>http://www.medworm.com/index.php?rid=5086453&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fbooks%2Fthe-metaphysical-countrygirl</link>
            <description>You, functional space
variants in voltage, the only light
Transitory effect of Love
several different lights
Sustain
Sustain them
you sustain them.

	&amp;nbsp;&amp;nbsp; ~ Omar Pérez, 2010
&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; translated by Krisitin Dykstra

	

Gentlemen: a little gift for the ladies who prefer Tesla coils to hair-rollers (Source: Trapped in the USA)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5086453</comments>
            <pubDate>Mon, 01 Aug 2011 02:57:44 +0100</pubDate>
            <guid isPermaLink="false">5086453</guid>        </item>
        <item>
            <title>Using the  Freebase and the Bioportal Widgets to create a semantic object.</title>
            <link>http://www.medworm.com/index.php?rid=5086449&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F08%2Fusing-freebase-and-bioportal-widgets-to.html</link>
            <description>The following HTML code uses:a Widget from the NCBO Bioportal for the Protein-protein interaction OntologyAnd two Widgets from Freebase, linking to the proteins, the genes and the diseases.After completion, it generates a JSON object describing a semantic object:{&quot;subject&quot;:&quot;http://www.freebase.com/view/en/nsp3&quot;,&quot;predicate&quot;:&quot;http://purl.org/obo/owl/MI#MI_0407&quot;,&quot;value&quot;:&quot;http://www.freebase.com/view (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5086449</comments>
            <pubDate>Sun, 31 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5086449</guid>        </item>
        <item>
            <title>Memory-Mapping the Human Genome with 'mmap': my notebook</title>
            <link>http://www.medworm.com/index.php?rid=5086450&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F08%2Fmemory-mapping-human-genome-with-mmap.html</link>
            <description>In this post, I've explored how to use a memory-mapped file to handle the 3Go of the Human Genome sequence as if it was entirely loaded in memory.Via wikipedia: &quot;A memory-mapped file is a segment of virtual memory which has been assigned a direct byte-for-byte correlation with some portion of a file or file-like resource. This resource is typically a file that is physically present on-disk... In (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5086450</comments>
            <pubDate>Sun, 31 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5086450</guid>        </item>
        <item>
            <title>Storing some SNPs using the leveldb library. My notebook.</title>
            <link>http://www.medworm.com/index.php?rid=5086451&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Fstoring-some-snps-using-leveldb-library.html</link>
            <description>In this post I'll describe how I've used the leveldb API, a C++ key-value database, to store some SNPs (key=rs,value=sequence). &quot;LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.&quot;. A benchmark for this engine is available here: http://code.google.com/p/leveldb/source/browse/trunk/doc/benchmark.html.Download &amp; install (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5086451</comments>
            <pubDate>Sat, 30 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5086451</guid>        </item>
        <item>
            <title>How Many Toes Does Ion Have Left?</title>
            <link>http://www.medworm.com/index.php?rid=5077938&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F07%2Fhow-many-toes-does-ion-have-left.html</link>
            <description>I&amp;#39;ve previously complained about Ion Torrent&amp;#39;s bungling and secrecy when it comes to educating their current and potential future users about key technical information.  I&amp;#39;ve recently come across yet another botch, one that underscores that attempting to control information in the Internet age only serves to distract from the goal of ensuring access to correct information.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5077938</comments>
            <pubDate>Thu, 28 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5077938</guid>        </item>
        <item>
            <title>I can’t resist a word cloud: now using R!</title>
            <link>http://www.medworm.com/index.php?rid=5077940&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F07%2F28%2Fi-cant-resist-a-word-cloud-now-using-r%2F</link>
            <description>Top 1000 words in FriendFeed comments, ISMB 2008-2011The wordcloud package is word clouds for R with a difference: they look great.
Of course, having just analysed online coverage of the ISMB conference, I had to run all 6 906 comments from the 2008-2011 meetings through some code. If you followed along via the Sweave code, I went as far as generating the data frame of comments, ismb.comments, then pulled the comment text into a new data frame using:
data.frame(ismb.comments$body)
It was then simply a case of following along with the excellent example code from the post Word Cloud in R, over at One R Tip A Day, limiting myself to the 1000 most-used words. Watch out, the TermDocumentMatrix() function from the tm package uses quite a lot of memory.
Result shown at right: click image for full...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5077940</comments>
            <pubDate>Thu, 28 Jul 2011 10:11:36 +0100</pubDate>
            <guid isPermaLink="false">5077940</guid>        </item>
        <item>
            <title>Analysis of ISMB coverage at FriendFeed: 2008 – 2011</title>
            <link>http://www.medworm.com/index.php?rid=5077941&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F07%2F28%2Fanalysis-of-ismb-coverage-at-friendfeed-2008-2011%2F</link>
            <description>ISMB/ECCB 2011 was held between July 15-19 this year and as in previous years, FriendFeed was used to cover the meeting.
Last year, I wrote a post about how to use R to analyse the coverage. I was planning something similar for 2011 when I thought: we have 4 years of ISMB at FriendFeed now &amp;#8211; why not look at all of them?
So I did. Read on for the details.




   1. First &amp;#8211; an apology
In my post from last year, I included some R code which will grab a FriendFeed feed in JSON format using the API and convert it to an R list. I would have used it again this year, except that the ISMB 2008 feed is no longer a complete archive of the 2008 meeting. This is, unfortunately, my fault &amp;#8211; but all is not lost.
I decided to leave FriendFeed earlier this year and began by deleting the se...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5077941</comments>
            <pubDate>Thu, 28 Jul 2011 02:40:17 +0100</pubDate>
            <guid isPermaLink="false">5077941</guid>        </item>
        <item>
            <title>Tschuess, Shuttle!</title>
            <link>http://www.medworm.com/index.php?rid=5077939&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F07%2Ftschuess-shuttle.html</link>
            <description>Clearly with a week having passed since touchdown, my farewell to the space shuttle program is grossly overdue.  I actually missed watching the event live, as I was off on a camping trip in some beautiful New Hampshire woods.   Atop of the usual excuses, it&amp;#39;s been a bit hard to figure out what I could write that was different than so much that has been written.  It would seem that every major media outlet opined on the subject, and most did a decent job in hitting the key points. Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5077939</comments>
            <pubDate>Wed, 27 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5077939</guid>        </item>
        <item>
            <title>Controlling IGV through a Port: my notebook.</title>
            <link>http://www.medworm.com/index.php?rid=5069714&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Fcontrolling-igv-through-port-my.html</link>
            <description>A protocol for interacting with the The Integrative Genomics Viewer (IGV) is described here: http://www.broadinstitute.org/igv/PortCommands. So, you can save some pictures of a genomic region without interacting with IGV.Via a shell script$ cat script.shmkdir -p /tmp/igvsleep 2echo &quot;snapshotDirectory /tmp/igv&quot;sleep 2echo &quot;goto chr3:100&quot;sleep 2echo &quot;snapshot&quot;sleep 2echo &quot;goto chr3:200-300&quot;sleep (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5069714</comments>
            <pubDate>Tue, 26 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5069714</guid>        </item>
        <item>
            <title>A mysql full-text parser searching for some SNPs</title>
            <link>http://www.medworm.com/index.php?rid=5069715&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Fmysql-full-text-parser-searching-for.html</link>
            <description>A mysql full-text parser server plugin can be used to replace or modify the built-in full-text parser.This post a full-text parser plugin named bioparser that extracts the rs#### id from a text.The sourceThe source of this plugin is available on github at:https://github.com/lindenb/bioudf/blob/master/src/bioparser.c.The work horse of the plugin is a simple function bioparser_parse scanning the (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5069715</comments>
            <pubDate>Mon, 25 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5069715</guid>        </item>
        <item>
            <title>The samtools C API: creating a WIG file for the genome coverage</title>
            <link>http://www.medworm.com/index.php?rid=5062425&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Fsamtools-c-api-creating-wig-file-for.html</link>
            <description>I wrote a tool which uses the samtools API to generate a WIG file for the genome coverage. This tool, bam2wig, as well as some other tools using this library are available in the following git repository:https://github.com/lindenb/samtools-utilitiesMy code is based on this original C source hosted on: http://samtools.sourceforge.net/sam-exam.shtml. (I hope I understood this API as many functions/ (Source: YOKOFAKUN)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5062425</comments>
            <pubDate>Sun, 24 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5062425</guid>        </item>
        <item>
            <title>Getting out gene list from Ensembl web query results with WWW::Mechanize</title>
            <link>http://www.medworm.com/index.php?rid=5571392&amp;cid=d_132_132_f&amp;fid=35036&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fblogspot%2FLuPcC%2F%7E3%2FeqMujKKaaew%2Fgetting-out-gene-list-from-ensembl-web.html</link>
            <description>Seems like the Ensembl API is not providing a way to get list of results (?) for web query, thus decided to write a simple perl script. So to get list of gene hits for Ensembl query 'histone h2a histone cluster 1' one can simply run:
perl getensidmech.pl 'histone h2a histone cluster 1'and the quick and dirty code&amp;nbsp; getensidmech.pl using WWW::Mechanize is there at http://research-code-base-animesh.googlecode.com/svn/trunk/code_lic/getensidmech.pl :) (Source: Bioinformatics Latest News)</description>
            <author>Bioinformatics Latest News</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5571392</comments>
            <pubDate>Wed, 20 Jul 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5571392</guid>        </item>
        <item>
            <title>Drawing a SVG timeline with the  http://data.bnf.fr data</title>
            <link>http://www.medworm.com/index.php?rid=5051028&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Fdrawing-svg-timeline-with-httpdatabnffr.html</link>
            <description>The National French Library has recently started to releaseits data as RDF/XML. Here, I've played with the biographies of the famous French writers to create a simple timeline.Unfortunately, this timeline is too large to be displayed here. :-)However, here is the java code I used to generate this map as a SVG document: See also: Freebase and the History of Sciences That's it,Pierre (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5051028</comments>
            <pubDate>Tue, 19 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5051028</guid>        </item>
        <item>
            <title>Intuition arises out of Mathematical Intractability</title>
            <link>http://www.medworm.com/index.php?rid=5051030&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fscience%2Fintuition-arises-out-of-mathematical-intractability</link>
            <description>In conclusion, all I want to add to Curious&amp;#8217;s post is that chemist rely on their intuition specifically because the mathematics in quantum chemistry does not admit easily solvable equations, or reliable low-order approximations. The intuition is needed simply because a useful mathematical solution is not tractable. Similar problems plague many sub-disciplines in physics but the physicists have a much better PR machine. (Source: Trapped in the USA)</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5051030</comments>
            <pubDate>Mon, 18 Jul 2011 13:51:53 +0100</pubDate>
            <guid isPermaLink="false">5051030</guid>        </item>
        <item>
            <title>An interactive 'dialog' generating a SAM Flag</title>
            <link>http://www.medworm.com/index.php?rid=5051029&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Finteractive-dialog-generating-sam-flag.html</link>
            <description>The following shell script uses the dialog utility to generate a numeric Sam flag.$ sh selectsam.sh    ┌───────────────────────SAM FLAGS──────────────────────────┐   │ SELECT FLAGS                       │    │ ┌──────────────────────────────────────────────────────┐ │    │ │ [X] 1   Read Paired                │ │    │ │ [ ] 2   Read (Source: YOKOFAKUN)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5051029</comments>
            <pubDate>Sun, 17 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5051029</guid>        </item>
        <item>
            <title>I give up</title>
            <link>http://www.medworm.com/index.php?rid=5028877&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F07%2F15%2Fi-give-up%2F</link>
            <description>It&amp;#8217;s what &amp;#8211; 10 years or more? &amp;#8211; since we began to wonder when web technologies such as RSS, wikis and social bookmarking sites would be widely adopted by most working scientists, to further their productivity.
The email that I received today which began &amp;#8220;I&amp;#8217;ve read 3 interesting papers&amp;#8221; and included 1 .doc, 3 .docx and 4 .pdf files as attachments is indicative of the answer to this question, which is &amp;#8220;not any time soon.&amp;#8221;
I&amp;#8217;ve given up trying to educate colleagues in best practices. Clearly, I&amp;#8217;m the one with the problem, since this is completely normal, acceptable behaviour for practically everyone that I&amp;#8217;ve ever worked with. Instead, I&amp;#8217;m just waiting for them to retire (or die). I reckon most senior scientists (and they...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5028877</comments>
            <pubDate>Fri, 15 Jul 2011 04:26:35 +0100</pubDate>
            <guid isPermaLink="false">5028877</guid>        </item>
        <item>
            <title>I can't save you, but I can help you: reverse-complementing a DNA in OpenOffice</title>
            <link>http://www.medworm.com/index.php?rid=5028878&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Fi-cant-save-you-but-i-can-help-you.html</link>
            <description>People in my lab are using &quot;Microsoft Word&quot; to edit and annotate their sequences. Just like Neil, I can't save them but I can help them: here is an OpenOffice / LibreOffice macro reverse-complementing the current selected text. open libreoffice/openofficeOpen menu &quot;Tools / Macros / Organize macros / Basic...Select &quot;My Macros/Standard&quot; and click on &quot;New&quot;Copy+paste my macro hereClick on the top (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5028878</comments>
            <pubDate>Thu, 14 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5028878</guid>        </item>
        <item>
            <title>A text alignment viewer using the samtools API</title>
            <link>http://www.medworm.com/index.php?rid=5028879&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Ftext-alignment-viewer-using-samtools.html</link>
            <description>I wrote a modified version of samtools tview. The original code uses the curses API to display an interactive sequence viewer.I wanted to generate this kind of screen for a large number of positions and I wanted to be able to redirect the output to a unix pipeline so I changed the original code in order to only handle an extensible 2-dimensional array of characters. The code I wrote is available (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5028879</comments>
            <pubDate>Wed, 13 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5028879</guid>        </item>
        <item>
            <title>Ion's Gaggle of New Accessory Products</title>
            <link>http://www.medworm.com/index.php?rid=5028875&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F07%2Fions-gaggle-of-new-accessory-products.html</link>
            <description>     Ion Torrent had a press release announcing a number of new components to the platform, as well as shipping of the 316 chip and a drop in price of the original 314 chip to $99 (alas, the announcement was not synchronized with updating the online store, so no details could be checked -- such as the price of the 316 chips).Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5028875</comments>
            <pubDate>Mon, 11 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5028875</guid>        </item>
        <item>
            <title>PacBio's Foray Against the German E.coli Outbreak</title>
            <link>http://www.medworm.com/index.php?rid=5028876&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F07%2Fpacbios-foray-against-german-ecoli.html</link>
            <description>&amp;nbsp; &amp;nbsp; I recently remarked that it was surprising that PacBio had not jumped on the German E.coli outbreak strain, given CEO Eric Schadt's professed interest in biosurveillance. &amp;nbsp;Dr. Schadt was kind enough to spend a half hour discussing the topic with me last week, in the wake of PacBio releasing de novo assemblies of this strain and 11 other pathogenic E.coli strains (6 having never previously been sequenced) on the PacBio DevNet website. Dr. Schadt also has a detailed blog post on the project.&amp;nbsp; &amp;nbsp; In both the blog post and our conversation yesterday he explained why PacBio wasn't in the forefront of this effort but still decided to jump in. Dr. Schadt was certainly aware of the outbreak; he was in Germany at a conference when the public alarm was building. But initi...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5028876</comments>
            <pubDate>Sun, 10 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5028876</guid>        </item>
        <item>
            <title>Storing SNPs in a HDF5 file: my notebook</title>
            <link>http://www.medworm.com/index.php?rid=5028880&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Fstoring-snps-in-hdf5-file-my-notebook.html</link>
            <description>I discovered the benefits of using twitter in 2008, the day Deepak Singh replied to one of my tweets related to the storage of a large number of genotypes.Since that day, I've tried to use the HDF5 library, without any success (there's a large disheartening documentation/API on the HDF5 site and the API seems to be only used by a small number of geeks). Furthermore, HDF5 is a technology used by (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5028880</comments>
            <pubDate>Fri, 08 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">5028880</guid>        </item>
        <item>
            <title>R: calculations involving months</title>
            <link>http://www.medworm.com/index.php?rid=5008521&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F07%2F07%2Fr-calculations-involving-months%2F</link>
            <description>Ask anyone how much time has elapsed since September last year and they&amp;#8217;ll probably start counting on their fingers: &amp;#8220;October, November&amp;#8230;&amp;#8221; and tell you &amp;#8220;just over 9 months.&amp;#8221;
So, when faced as I was today with a data frame (named dates) like this:

pmid1    year1  month1   pmid2   year2  month2
21355427  2010   Dec    21542215  2011   Mar
21323727  2011   Feb    21521365  2011   Jun
21297532  2011   Feb    21336080  2011   Mar
21291296  2011   Apr    21591868  2011   Jun
...

How to add a 7th column, with the number of months between &amp;#8220;year1/month1&amp;#8243; and &amp;#8220;year2/month2&amp;#8243;?

R has lots of methods for date/time calculations, many of which seem notoriously difficult to get your head around. One answer for my problem came, as so often, from ...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5008521</comments>
            <pubDate>Thu, 07 Jul 2011 13:32:38 +0100</pubDate>
            <guid isPermaLink="false">5008521</guid>        </item>
        <item>
            <title>Second M-Eco User Workshop</title>
            <link>http://www.medworm.com/index.php?rid=4997747&amp;cid=d_132_132_f&amp;fid=39192&amp;url=http%3A%2F%2Fmeco-project.blogspot.com%2F2011%2F07%2Fsecond-m-eco-user-workshop.html</link>
            <description>On May 26, the second M-Eco user workshop took place. Again, representatives of WHO, INVS, ECDC as well as from the Mekong Basin Disease Surveillance Oragnisation attended the meeting. he consortium presented the first prototype and several evaluation experiments have been performed. The user feedback will be considered in the secon phase of the project. (Source: Medical Ecosystem for Personalized Event-Based Surveillance)</description>
            <author>Medical Ecosystem for Personalized Event-Based Surveillance</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4997747</comments>
            <pubDate>Mon, 04 Jul 2011 08:40:00 +0100</pubDate>
            <guid isPermaLink="false">4997747</guid>        </item>
        <item>
            <title>The Neo4j REST API. My notebook</title>
            <link>http://www.medworm.com/index.php?rid=4997744&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F07%2Fneo4j-rest-api-my-notebook.html</link>
            <description>Neo4j is a open-source graph engine implemented in Java. This post is my notebook for the Neo4J-server, a server combining a REST API and a webadmin application into a single stand-alone server.Download &amp; extract Neo4JDownload Neo4J from here$ tar xfz neo4j-community-1.4.M04-unix.tar.gz$ rm neo4j-community-1.4.M04-unix.tar.gz$ cd neo4j-community-1.4.M04/Start the Neo4J Server#edit the config (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4997744</comments>
            <pubDate>Sun, 03 Jul 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4997744</guid>        </item>
        <item>
            <title>Workshop at GMDS conference</title>
            <link>http://www.medworm.com/index.php?rid=4976122&amp;cid=d_132_132_f&amp;fid=39192&amp;url=http%3A%2F%2Fmeco-project.blogspot.com%2F2011%2F06%2Fworkshop-at-gmds-conference.html</link>
            <description>The M-Eco consortium will held a workshop on the topic &quot;From indicator-based to event-based surveillance&quot; at the Annual Meeting of the German Sociaty of Medical Informatics and Society of Epidemiology. The workshop will be held on September 28 at the Johannes-Gutenberg Universität in Mainz. More details are available:here (Source: Medical Ecosystem for Personalized Event-Based Surveillance)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Medical Ecosystem for Personalized Event-Based Surveillance</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4976122</comments>
            <pubDate>Tue, 28 Jun 2011 07:24:00 +0100</pubDate>
            <guid isPermaLink="false">4976122</guid>        </item>
        <item>
            <title>The java library for BigBed and BigWig: my notebook</title>
            <link>http://www.medworm.com/index.php?rid=4976120&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fjava-library-for-bigbed-and-bigwig-my.html</link>
            <description>Jim Robinson and his team, from the Broad Institute/IGV, have recently released a java library parsing the BigBed and the BigWig formats. Here is my notebook for this API.Download and compile the libraryThe sources are hosted at: http://bigwig.googlecode.com/.$ svn checkout http://bigwig.googlecode.com/svn/trunk/ bigwig-read-only$ cd bigwig-read-only$ antBuildfile: build.xmlcompile:  [mkdir] (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4976120</comments>
            <pubDate>Mon, 27 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4976120</guid>        </item>
        <item>
            <title>Bioinformatician 2.0 ( JeBiF Workshop 2011 )</title>
            <link>http://www.medworm.com/index.php?rid=4976121&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fbioinformatician-20-jebif-workshop-2011.html</link>
            <description>Here is the presentation I gave for the JeBiF Workshop 2011 (jobs and careers in bioinformatics).Bioinformatician 2.0 View more presentations from Pierre Lindenbaum Pierre Legrain (CEA) and Franck Molina, CNRS talking about working in Bioinformatics in the industry and/or academia. (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4976121</comments>
            <pubDate>Mon, 27 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4976121</guid>        </item>
        <item>
            <title>MiSeq Aims for Ion Torrent</title>
            <link>http://www.medworm.com/index.php?rid=4968768&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F06%2Fmiseq-aims-for-ion-torrent.html</link>
            <description>     Illumina has released a MiSeq dataset for E.coli MG1655 on its website.  Accompanying the FASTQ and BAM files is a presentation, the first half of which compares the performance to big brother HiSeq.  The second half is an explicit comparison against the available Ion Torrent. Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4968768</comments>
            <pubDate>Thu, 23 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4968768</guid>        </item>
        <item>
            <title>A first peek at data from an Ion 316 chip</title>
            <link>http://www.medworm.com/index.php?rid=4960247&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F06%2Ffirst-peek-at-data-from-ion-316-chip.html</link>
            <description>      Ion Torrent has made available an E.coli DH10B fragment dataset for the 316 chip, which is expected to be generally available early next month.  I&amp;#39;ve theoretically had access to the data since Saturday, but a series of events (computer-free weekend, a crashed home computer, a personal day &amp; a good but unbloggable conference) have meant that ambitious plans to analyze it are still in progress.  So, what I discuss below is either from Ion&amp;#39;s provided information or other sources, so take it with appropriate caution.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4960247</comments>
            <pubDate>Mon, 20 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4960247</guid>        </item>
        <item>
            <title>Two administrative notes</title>
            <link>http://www.medworm.com/index.php?rid=4960248&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F06%2Ftwo-administrative-notes.html</link>
            <description>Two administrative items:First, sometimes comments are getting moderated very late.  I do pledge this is not an attempt on my part to bury negative commentary.  Rather, it appears that sometimes when I thought I had moderated comments via my Android phone it hasn&amp;#39;t succeeded.   Second, I am trying out the new smartphone-friendly layout option for Blogger.  Please let me know if you are reading this on a smartphone and hate the new layout.Read more » (Source: Omics! Omics!)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4960248</comments>
            <pubDate>Mon, 20 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4960248</guid>        </item>
        <item>
            <title>Woman drowned in molten chocolate</title>
            <link>http://www.medworm.com/index.php?rid=4953232&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fperformance%2Fwoman-drowned-in-molten-chocolate</link>
            <description>Oh, to be a performance artist in NYC. 

	Beguiling mix of textures, facial transmorgification, guttural colors, and drowning. Cocoally good:

Chocolate, 2010 from MARTYNKA WAWRZYNIAK on Vimeo. (Source: Trapped in the USA)</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4953232</comments>
            <pubDate>Mon, 20 Jun 2011 13:37:50 +0100</pubDate>
            <guid isPermaLink="false">4953232</guid>        </item>
        <item>
            <title>A couchdb-like application using apache JETTY and BerkeleyDB. My notebook</title>
            <link>http://www.medworm.com/index.php?rid=4953231&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fcouchdb-like-application-using-apache.html</link>
            <description>This is my notebook for the Jetty API. Jetty provides an Web server and javax.servlet container. It can be embedded in devices, tools, frameworks, application servers, and clusters. I find it more easier for developping and testing a new web application as it doesn't require some deployement descriptors, some configuration files, a web container, etc... like in apache tomcat. As I will show below (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4953231</comments>
            <pubDate>Sat, 18 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4953231</guid>        </item>
        <item>
            <title>E.coli Outbreak Genomics</title>
            <link>http://www.medworm.com/index.php?rid=4934665&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F06%2Fecoli-outbreak-genomics.html</link>
            <description>     The E.coli outbreak in Germany continues to be a major news item.  It is looking increasingly doubtful that the source of the infection will be conclusively traced, as the German authorities have already named and then backed off two suspects, Spanish cucumbers and German bean sprouts.  These activities have not been without repercussions; Spanish agriculture has been hard hit and exports of European produce in general are reportedly hurting.     On the genomics front, the outbreak has demonstrated how quickly bacterial genomics can be run on the current class of instrumentation.  BGI Europe knocked off the sequence in a series of Ion Torrent runs in 3 days, and a group at University of Muenster worked at similar speed with the Ion setup as well.  Later, sequences have come ...</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4934665</comments>
            <pubDate>Tue, 14 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4934665</guid>        </item>
        <item>
            <title>Sorting and Joining some Genotypes with Hadoop.</title>
            <link>http://www.medworm.com/index.php?rid=4934667&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fsorting-and-joining-some-genotypes-with.html</link>
            <description>This post describes how I've used hadoop to merge two large files produced by our Affymetrix-Axiom Genotyping platform.(Note: working with the last version of hadoop has been a headache: it contains some classes with the same name: ('org.apache.hadoop.mapreduce.Reducer' and 'org.apache.hadoop.mapred.Reducer' ), many classes have been deprecated ( org.apache.hadoop.mapred.JobConf ) but they are (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4934667</comments>
            <pubDate>Tue, 14 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4934667</guid>        </item>
        <item>
            <title>XSLT+SVG= &quot;It Came from Outer Space&quot; (anaglyphs)</title>
            <link>http://www.medworm.com/index.php?rid=4934668&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fxsltsvg-it-came-from-outer-space.html</link>
            <description>&quot;It Came from Outer Space&quot;, Universal's first film to be filmed in 3-D.Just like with the &quot;Gimp&quot;, Inkscape , a SVG editor, can manage some layers:Each layer will appear in the SVG document as an element svg:g with an attribute: inkscape:groupmode=&quot;layer&quot; under the root element svg:svg. I've written a XSLT stylesheet that converts the SVG image to an Anaglyph image. The stylesheet is available on (Source: YOKOFAKUN)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4934668</comments>
            <pubDate>Sun, 12 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4934668</guid>        </item>
        <item>
            <title>Startups, tenure and real-world systems</title>
            <link>http://www.medworm.com/index.php?rid=4934666&amp;cid=d_132_132_f&amp;fid=35011&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fmndoci%2F%7E3%2FYfHJ5LxKF7I%2F</link>
            <description>Tweet	Earlier today I visited DBMS Musings to read Daniel Abadi&amp;#8217;s blog post on peer review, but ended up re-reading an older post. In the post Daniel talks about why he is doing a startup pre-tenure. I am not an academic, neither have I ever started a company, although I&amp;#8217;ve been in both environments, so I can&amp;#8217;t quite speak from experience here, but the post caught my attention at multiple levels. What makes this interesting is that Daniel has spent time on getting his company off the ground himself, with active involvement, and the resulting leave of absence limits his ability to do things junior faculty are expected to do, like publish. He admits that there is little precedent and that it&amp;#8217;s a gamble. So why go on?
	Admittedly, this is computer science, but the key ...</description>
            <author>business|bytes|genes|molecules</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4934666</comments>
            <pubDate>Sun, 12 Jun 2011 06:44:50 +0100</pubDate>
            <guid isPermaLink="false">4934666</guid>        </item>
        <item>
            <title>An extension for Inkscape processing a sequence from Genbank</title>
            <link>http://www.medworm.com/index.php?rid=4921654&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fextension-for-inkscape-processing.html</link>
            <description>Let's create a new extension for inkscape by recycling an old XSLT stylsheet producing a SVG image ! For example, we're going to use this xslt stylsheet: https://github.com/lindenb/xslt-sandbox/blob/master/stylesheets/bio/ncbi/gb2svg.xsl that transforms a Genbank XML INSDSeq to SVG. It is downloaded into the extension directory; ${HOME}/.config/inkscape/extensions/gb2svg.xslNow let's tell (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4921654</comments>
            <pubDate>Thu, 09 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4921654</guid>        </item>
        <item>
            <title>Vectorial drawing with the UCSC Genome browser and inkscape</title>
            <link>http://www.medworm.com/index.php?rid=4921655&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fvectorial-drawing-with-ucsc-genome.html</link>
            <description>Select a view in the UCSC genome browser. Export it (top menu) it as Encapsulated Postscript (EPS)Import the EPS file in inkscape'Ungroup' the objects and have fun.That's it.Pierre (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4921655</comments>
            <pubDate>Thu, 09 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4921655</guid>        </item>
        <item>
            <title>Handling a BigWig file from Java+JNI : my notebook</title>
            <link>http://www.medworm.com/index.php?rid=4911739&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fhandling-bigwig-file-from-javajni-my.html</link>
            <description>In a previous post I've shown how to use a Java JNI interface to control a NCURSES panel from a java program.I'll show here how to use java JNI to call the C methods developped by Jim Kent's team at UCSC to handle a BigWig file.The Makefile for compiling the following sources is defined below:Export the LD_LIBRARY_PATH:export LD_LIBRARY_PATH=/path/to/current/directoryDownload Jim Kent's sources (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4911739</comments>
            <pubDate>Mon, 06 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4911739</guid>        </item>
        <item>
            <title>If you have too much data, then “good enough” is good enough</title>
            <link>http://www.medworm.com/index.php?rid=4902616&amp;cid=d_132_132_f&amp;fid=35011&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fmndoci%2F%7E3%2FUPd_4dJzp1g%2F</link>
            <description>Tweet	
	I would suggest that all my friends in the world of bioinformatics read this fabulous article by Pat Helland. Pat&amp;#8217;s on of the leading experts in distributed transactions and knows more about databases than most of us put together. His ACM article goes into some the tradeoffs and changes in mindset that need to me made when working with data that changes and comes from different sources, and all so o ften has ambiguity associated with it. It also tells you a little but about the differences in SQL and NoSQL systems when it comes to transaction semantics and in a way that meets complete sense. 
	Perhaps the most interesting part of the article was the section on &amp;#8220;Mulligan stew&amp;#8221; where we also provides the example of building a heterogeneous catalog. A product catalog...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>business|bytes|genes|molecules</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4902616</comments>
            <pubDate>Mon, 06 Jun 2011 19:16:36 +0100</pubDate>
            <guid isPermaLink="false">4902616</guid>        </item>
        <item>
            <title>The Arsenic Bug, Revisited</title>
            <link>http://www.medworm.com/index.php?rid=4902615&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F06%2Farsenic-bug-revisited.html</link>
            <description>As has been covered in many outlets, Science has released eight technical comments on the arsenic-loving bacterium issue along with a response by the authors, which I commented on when it appeared.  A good summary of the comments can be found at In The Pipeline, and Fejes.ca has a good take on it as well.  The quick summary is that various technical and theoretical issues are presented by the critics, and the NASA team dismisses all of them by standing by their data and interpretations.  Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4902615</comments>
            <pubDate>Sat, 04 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4902615</guid>        </item>
        <item>
            <title>Pubmed: sorting the articles on the number of times they've been cited</title>
            <link>http://www.medworm.com/index.php?rid=4902617&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F06%2Fpubmed-sorting-articles-on-number-of.html</link>
            <description>In 2008 I used www.eigenfactor.org/ to sort a set of Pubmed articles on the impact factor of the journal. In the current post I will show I've used NCBI ELink to sort the articles on the number of times they've have been cited in some other articles in pubmed-central.The NCBI ELink API checks for the existence of an external or  link from a list of one or more primary IDs. It can (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4902617</comments>
            <pubDate>Fri, 03 Jun 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4902617</guid>        </item>
        <item>
            <title>The Pain: Installing Matplotlib in Snow Leopard on a Mac</title>
            <link>http://www.medworm.com/index.php?rid=4893761&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fscience%2Fthe-pain-installing-matplotlib-in-snow-leopard-on-a-mac</link>
            <description>I&amp;#8217;ve said it before, and I&amp;#8217;ll say it again, python combined with matplotlib is one of the most potent tools for scientific computation. But gee, why does it have to be so hard to install it on my Macbook with Snow Leopard?

	I&amp;#8217;ve finally figured it out how to install it on my computer with some measure of understanding. So here are some tips.

	If you really want to get it working RIGHT NOW, I suggest you use macports, and just do

sudo port install py26-matplotlib


	which will install a separate version of python2.6 in the &amp;#8216;/opt/local&amp;#8217; directory. However, everything will work. The down side of this approach is that other packages that you install through a .dmg will fail to find that version of python installed in &amp;#8216;/opt/local&amp;#8217;. Alas, the standard...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4893761</comments>
            <pubDate>Fri, 03 Jun 2011 06:58:46 +0100</pubDate>
            <guid isPermaLink="false">4893761</guid>        </item>
        <item>
            <title>agile redUX</title>
            <link>http://www.medworm.com/index.php?rid=4893760&amp;cid=d_132_132_f&amp;fid=35027&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Finforbiomatica%2F%7E3%2F-YIiBPscBF0%2F</link>
            <description>I blogged (more) recently (than here) at balancedteam.org on agile redUX: http://www.balancedteam.org/2010/12/20/agile-redux/ (Source: Inforbiomatica)</description>
            <author>Inforbiomatica</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4893760</comments>
            <pubDate>Thu, 02 Jun 2011 17:39:51 +0100</pubDate>
            <guid isPermaLink="false">4893760</guid>        </item>
        <item>
            <title>Finding the truth is a waste of time, scientists say</title>
            <link>http://www.medworm.com/index.php?rid=4893759&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F06%2F02%2Ffinding-the-truth-is-a-waste-of-time-scientists-say%2F</link>
            <description>Before bioinformatics, I worked in both biochemistry and microbiology labs, including a stint in the field of extremophile biology. So like many other people, I&amp;#8217;ve been following the &amp;#8220;arsenic life&amp;#8221; story with great interest.
It seems that far more has been written about the publication, the manner of its announcement, the ensuing online debate and the personalities involved than about the principal scientific question: can arsenate substitute for phosphate in biological molecules? My opinion, for what it&amp;#8217;s worth, is that the Science paper presented no compelling evidence for covalently-bound As in DNA and that the editors should have asked the authors either to do better experiments or tone it down.
Now, Nature News reports that the criticisms are in, the authors ha...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4893759</comments>
            <pubDate>Thu, 02 Jun 2011 02:29:42 +0100</pubDate>
            <guid isPermaLink="false">4893759</guid>        </item>
        <item>
            <title>Playing with the Gimp server: my notebook</title>
            <link>http://www.medworm.com/index.php?rid=4883815&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F05%2Fplaying-with-gimp-server-my-notebook.html</link>
            <description>In the following post, I'll show how the Gimp can be used as a server that will read and execute some statements in the scheme programming language. The scheme script will be used to downlad, resize and filter an image from the web.Starting the Gimp ServerGo in menu Filters→Script-Fu→Start ServerThe original image(3156 x 2538)Barbara McClintock in the lab at Cold Spring Harbor in 1947The (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4883815</comments>
            <pubDate>Mon, 30 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4883815</guid>        </item>
        <item>
            <title>Computing and abstractions revisited</title>
            <link>http://www.medworm.com/index.php?rid=4883814&amp;cid=d_132_132_f&amp;fid=35011&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fmndoci%2F%7E3%2FDCMLe-cIKq4%2F</link>
            <description>Tweet	Computing is critical. 
	Computing is not important.
	Two seeming contradictory statements, but I&amp;#8217;ll point to Google. Their success is built on computing, lots of it. The end user doesn&amp;#8217;t care. They get information back. They don&amp;#8217;t have to set up clusters and queries. They just search. While that&amp;#8217;s a gross oversimplication for science, there are aspects of computing productivity that just don&amp;#8217;t make sense anymore. I was reminded of this point by a question on Biostar about setting up MPI. There are different level of skill sets required and I might be one of the first to argue that a computational scientist should understand their tool, the computer. But at what level? In the end I love abstractions and productivity. The more you have to spend time setti...</description>
            <author>business|bytes|genes|molecules</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4883814</comments>
            <pubDate>Sun, 29 May 2011 17:05:22 +0100</pubDate>
            <guid isPermaLink="false">4883814</guid>        </item>
        <item>
            <title>Cancer classification using Google prediction API</title>
            <link>http://www.medworm.com/index.php?rid=4876475&amp;cid=d_132_132_f&amp;fid=35036&amp;url=http%3A%2F%2Fcomputationalbiologynews.blogspot.com%2F2011%2F05%2Fcancer-classification-using-google.html</link>
            <description>Google's prediction API is now available to everyone with nice tutorial on how to get going :) so I decided to check its performance using gene expression values for 5 selected genes using online-feature-selection from the classic AML/ALL dataset (training/test).The dataset needed to be formatted (training/test) and copied to Google storage to get the default scripts working. Coming to the results, well the training step:./oauth-train.sh mlani/top5_tr.csvgave a performance of about 3/4:./oauth-check-training.sh mlani/top5_tr.csv{&amp;nbsp;&quot;kind&quot;: &quot;prediction#training&quot;,&amp;nbsp;&quot;id&quot;: &quot;mlani/top5_tr.csv&quot;,&amp;nbsp;&quot;selfLink&quot;: &quot;https://www.googleapis.com/prediction/v1.2/training/mlani/top5_tr.csv&quot;,&amp;nbsp;&quot;modelInfo&quot;: {&amp;nbsp; &quot;modelType&quot;: &quot;classification&quot;,&amp;nbsp; &quot;classificationAccuracy&quot;: 0.75&amp;nbsp;},&amp;nbsp...</description>
            <author>Bioinformatics Latest News</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4876475</comments>
            <pubDate>Sun, 29 May 2011 12:42:00 +0100</pubDate>
            <guid isPermaLink="false">4876475</guid>        </item>
        <item>
            <title>Purity in the atomic force-fields of molecular dynamics simulations</title>
            <link>http://www.medworm.com/index.php?rid=4876474&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fprotein%2Fpurity-in-the-atomic-force-fields-of-molecular-dynamics-simulations</link>
            <description>In the protein modeling community, there is a split between those who pursue a more &amp;#8220;physics-based&amp;#8221; approach, and those pursue a more &amp;#8220;empirical-based&amp;#8221; approach. Nothing highlights this better than the acrimonious debate between atomic force-fields and statistical potentials. These force-fields are used in everything from protein folding studies, ligand-binding studies and protein-protein interaction studies.

	From a brutally pragmatic point of view, the successes in protein folding and homology modeling show that empirical force-fields have galloped far ahead. Still, the purists leer down on empirical force-fields, as they pray themselves to sleep at night that the test of time will expose empirical force-fields for the dirty statistic aberrations that they are.

...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4876474</comments>
            <pubDate>Sun, 29 May 2011 05:56:51 +0100</pubDate>
            <guid isPermaLink="false">4876474</guid>        </item>
        <item>
            <title>Cancer classification using Google prediction API</title>
            <link>http://www.medworm.com/index.php?rid=5571393&amp;cid=d_132_132_f&amp;fid=35036&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fblogspot%2FLuPcC%2F%7E3%2FUIYp-lyA8K0%2Fcancer-classification-using-google.html</link>
            <description>Google's prediction API is now available to everyone with nice tutorial on how to get going :) so I decided to check its performance using gene expression values for 5 selected genes using online-feature-selection from the classic AML/ALL dataset (training/test).
The dataset needed to be formatted (training/test) and copied to Google storage to get the default scripts working. 
Coming to the results, well the training step:
./oauth-train.sh mlani/top5_tr.csvgave a performance of about 3/4:
./oauth-check-training.sh mlani/top5_tr.csv
{
&amp;nbsp;&quot;kind&quot;: &quot;prediction#training&quot;,
&amp;nbsp;&quot;id&quot;: &quot;mlani/top5_tr.csv&quot;,
&amp;nbsp;&quot;selfLink&quot;: &quot;https://www.googleapis.com/prediction/v1.2/training/mlani/top5_tr.csv&quot;,
&amp;nbsp;&quot;modelInfo&quot;: {
&amp;nbsp; &quot;modelType&quot;: &quot;classification&quot;,
&amp;nbsp; &quot;classificationAccuracy&quot;: 0.75
&amp;...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Bioinformatics Latest News</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5571393</comments>
            <pubDate>Sun, 29 May 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5571393</guid>        </item>
        <item>
            <title>Data, software, and money</title>
            <link>http://www.medworm.com/index.php?rid=4876473&amp;cid=d_132_132_f&amp;fid=35011&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fmndoci%2F%7E3%2FNumKmYkZXMY%2F</link>
            <description>Tweet	
Steve O&amp;#8217;Grady has written a blog post about a recent talk he gave at OSBC. In the post he welcomes the Age of Data. The talk covers two topics of great interest, software and data. In the context of the life sciences I have worked on both the &amp;#8220;data as a product&amp;#8221; side and on the packaged software side. He notes that none of the top &amp;#8220;software&amp;#8221; companies in the world are of recent vintage. These are companies making money from selling software (a really difficult business in the sciences). He argues that data driven products is where the market is. The success of Google and others is a testament to this, but in the sciences the entire model of data as product has never worked. I would argue that this is partly cause we&amp;#8217;ve always sold the data itself ...</description>
            <author>business|bytes|genes|molecules</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4876473</comments>
            <pubDate>Sun, 29 May 2011 00:38:14 +0100</pubDate>
            <guid isPermaLink="false">4876473</guid>        </item>
        <item>
            <title>Paying a Painful 75% Secrecy Tax</title>
            <link>http://www.medworm.com/index.php?rid=4872342&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Fpaying-painful-75-secrecy-tax.html</link>
            <description>In a post a while back, I mentioned that my Ion Torrent sequencing project was stalled because my service provider couldn&amp;#39;t get some of the key kits, despite an Ion representative posting that no such shortages existed.  I&amp;#39;ve been remiss in updating that; last Tuesday the kits showed up and Monday I got my data -- and a bit of a shock.Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4872342</comments>
            <pubDate>Wed, 25 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4872342</guid>        </item>
        <item>
            <title>Predicting kinase specificity from phosphorylation data</title>
            <link>http://www.medworm.com/index.php?rid=4862805&amp;cid=d_132_132_f&amp;fid=35013&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fpedrobeltrao%2F%7E3%2Fok_OCpJmHcw%2Fpredicting-kinase-specificity-from.html</link>
            <description>Over the past few years, improvements in mass-spectrometry methods have resulted in a big increase in throughput for the identification of post-translational modifications (PTMs). It is even hard to keep up with all the phosphoproteomics papers and the accumulation of phosphorylation data. Most often, improvements in methods result in interesting&amp;nbsp;challenges&amp;nbsp;and&amp;nbsp;opportunities. In this case, how can we make use of this explosion in PTM data ? I will try to explore a fairly straightforward idea, on how to use phosphorylation data to predict kinase substrate specificity. I'll describe here the general idea and just the first stab at it to show that I think it can work.

The inspiration for this is the work by Neduva and colleagues that have show that we can search for enriched m...</description>
            <author>Public Rambling</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4862805</comments>
            <pubDate>Wed, 25 May 2011 17:37:00 +0100</pubDate>
            <guid isPermaLink="false">4862805</guid>        </item>
        <item>
            <title>MiSeq's First Light</title>
            <link>http://www.medworm.com/index.php?rid=4862804&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Fmiseqs-first-light.html</link>
            <description>Someone was kind enough to send me a copy of a poster by Illumina reporting results from the MiSeq.  Now, to be very upfront, by someone I mean &amp;quot;a person from a PR firm contracted by Illumina&amp;quot; and &amp;quot;kind enough&amp;quot; that she was doing her job.  I don&amp;#39;t have illusions about motive here, the author list is all from Illumina or Epicentre, but this was a poster presented at the recent Cold Spring Harbor Biology of Genomes meeting.  It certainly isn&amp;#39;t peer-reviewed data, but it is something. Of course, we can&amp;#39;t know to what degree these are cherry-picked results.  If you want to be really cynical, call it messaging and not data.  Yes, I&amp;#39;ve taken some flak in the comments recently about how favorable my coverage of Ion has been, and I&amp;#39;m trying to adjust. ...</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4862804</comments>
            <pubDate>Sun, 22 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4862804</guid>        </item>
        <item>
            <title>Jargon: The Art of Naming Things</title>
            <link>http://www.medworm.com/index.php?rid=4853087&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fscience%2Fjargon-the-art-of-naming-things</link>
            <description>I like jargon but jargon, like anything else, can be used for good, or for evil. Whilst there are plenty of writing guides that rail against the use of jargon, such advice would make us miss out on the richness that carefully minted technical terms add to the language. Our language would be poorer without words such as entropy or DNA or tectonic plates.

	Still, an eternally recurring complaint is that academic articles are unreadable because they are riddled with jargon, prose-killing, soul-sucking jargon. But is jargon the real problem? Perhaps it&amp;#8217;s merely the sloppy use of jargon. I don&amp;#8217;t believe that all science writers out there are deliberately using jargon to obfuscate their thoughts. It&amp;#8217;s just that no-one&amp;#8217;s really framed a way for them to use jargon effectiv...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4853087</comments>
            <pubDate>Sun, 22 May 2011 21:47:20 +0100</pubDate>
            <guid isPermaLink="false">4853087</guid>        </item>
        <item>
            <title>PERL one liner to generate 1000 base genome</title>
            <link>http://www.medworm.com/index.php?rid=5571394&amp;cid=d_132_132_f&amp;fid=35036&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fblogspot%2FLuPcC%2F%7E3%2FnXnNg6SNl0g%2Fperl-one-liner-to-generate-1000-base.html</link>
            <description>perl -e '@b=qw/A T G C/;print &quot;&amp;gt;Genome\n&quot;;while($l&amp;lt;1000){print @b[int(rand(4))];$l++;}' (Source: Bioinformatics Latest News)</description>
            <author>Bioinformatics Latest News</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=5571394</comments>
            <pubDate>Sun, 22 May 2011 04:00:00 +0100</pubDate>
            <guid isPermaLink="false">5571394</guid>        </item>
        <item>
            <title>Syntax highlighting of R code at WordPress.com</title>
            <link>http://www.medworm.com/index.php?rid=4848095&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F05%2F20%2Fsyntax-highlighting-of-r-code-at-wordpress-com%2F</link>
            <description>If your WordPress blog is hosted at WordPress.com (like this one), you may know that source code in posts is formatted and highlighted using a shortcode, as explained here.
Until recently, R was not on the list of supported languages (neither was Perl), but I noticed today that both of them are now supported. This may have happened months ago and I just wasn&amp;#8217;t aware of it. I spent a merry hour or so updating my old posts in the R category, so the code should now feature pretty colours. Let me know if you spot any errors that I may have introduced inadvertently.
I guess this is in some sense recognition that R is now used quite widely and that usage is growing.
Filed under: programming, R, statistics, this blog Tagged: shortcode, syntax highlighting, wordpress (Source: What You're Doi...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4848095</comments>
            <pubDate>Fri, 20 May 2011 07:57:46 +0100</pubDate>
            <guid isPermaLink="false">4848095</guid>        </item>
        <item>
            <title>Friday fun with: Google Trends</title>
            <link>http://www.medworm.com/index.php?rid=4848096&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F05%2F20%2Ffriday-fun-with-google-trends%2F</link>
            <description>Some years ago, Google discovered that when people are concerned about influenza, they search for flu-related information and that to some extent, search traffic is an indicator of flu activity. Google Flu Trends was born.
Google Trends: bronchitisIllness is sweeping through our department this week and I have succumbed. It&amp;#8217;s not flu but at one point, I did wonder if my symptoms were those of bronchitis. Remembering Google Flu Trends, I thought I&amp;#8217;d try my query for &amp;#8220;bronchitis&amp;#8221; at Google Trends, where I saw the chart shown at right.
Interesting. Clearly seasonal, peaking around the latest and earliest months of each year. Winter, for those of you in the northern hemisphere.
Next:

select USA and Australia as regions
download the data in CSV format (I chose fixed sca...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4848096</comments>
            <pubDate>Fri, 20 May 2011 01:55:17 +0100</pubDate>
            <guid isPermaLink="false">4848096</guid>        </item>
        <item>
            <title>Forums: Open Beats Closed Hands Down</title>
            <link>http://www.medworm.com/index.php?rid=4848094&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Fforums-open-beats-closed-hands-down.html</link>
            <description>Around the Internet, there are a number of communities in which scientists can swap useful information. &amp;nbsp; SEQAnswers is a very useful site I frequent; BioStar is one I don't but probably should. &amp;nbsp;Life Technologies has set up a community around Ion Torrent, and the contrast between that and SEQAnswers is a useful one.SEQAnswers has a straightforward access policy. &amp;nbsp;Anyone can view content, but to post or reply you must register for a membership. &amp;nbsp;This approach appears to have been very successful, as there is a healthy number of individuals posting to the site. &amp;nbsp;You can browse around and figure out if the site applies, plus search engines such as Google can steer individuals in. &amp;nbsp;SEQAnswers boasts a number of authors of major second generation sequencing analys...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4848094</comments>
            <pubDate>Wed, 18 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4848094</guid>        </item>
        <item>
            <title>How to: create a partial UCSC genome MySQL database</title>
            <link>http://www.medworm.com/index.php?rid=4841830&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F05%2F18%2Fhow-to-create-a-partial-ucsc-genome-mysql-database%2F</link>
            <description>File under: simple, but a useful reminder
UCSC Genome Bioinformatics is one of the go-to locations for genomic data. They are also kind enough to provide access to their MySQL database server:

mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A

However, users are given fair warning to &amp;#8220;avoid excessive or heavy queries that may impact the server performance.&amp;#8221; It&amp;#8217;s not clear what constitutes excessive or heavy but if you&amp;#8217;re in any doubt, it&amp;#8217;s easy to create your own databases locally. It&amp;#8217;s also easy to create only the tables that you require, as and when you need them.
As an example, here&amp;#8217;s how you could create only the ensGene table for the latest hg19 database. Here, USER and PASSWD represent a local MySQL user and password with full privileg...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4841830</comments>
            <pubDate>Wed, 18 May 2011 04:54:10 +0100</pubDate>
            <guid isPermaLink="false">4841830</guid>        </item>
        <item>
            <title>Friday fun projects</title>
            <link>http://www.medworm.com/index.php?rid=4821061&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F05%2F14%2Ffriday-fun-projects%2F</link>
            <description>What&amp;#8217;s a &amp;#8220;Friday fun project&amp;#8221;? It&amp;#8217;s a small computing project, perfect for a Friday afternoon, which serves the dual purpose of (1) keeping your programming/data analysis skills sharp and (2) providing a mental break from the grind of your day job. Ideally, the skills learned on the project are useful and transferable to your work projects.
This post describes one of my Friday fun projects: how good is the Sydney weather forecast? There&amp;#8217;ll be some Ruby, some R and some MongoDB. Bioinformatics? No, but some thoughts on that at the end.

1. Define the problem
Step 1 in any project is to define the problem. The question &amp;#8220;how good is a weather forecast?&amp;#8221; might be addressed in many ways. Obviously, we need to collect two kinds of data: predictions and o...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4821061</comments>
            <pubDate>Sat, 14 May 2011 06:13:46 +0100</pubDate>
            <guid isPermaLink="false">4821061</guid>        </item>
        <item>
            <title>Oh Would It Be Fun to Debug Strobe Sequencing!</title>
            <link>http://www.medworm.com/index.php?rid=4829169&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Foh-would-it-be-fun-to-debug-strobe.html</link>
            <description>Through the course of a month, I easily sketch out (on average) a dozen plus experimental designs in the context of my day job.  Of course, only a very few of these are ever executed; many never get shown to another soul.  By constantly pondering how I might tackle questions, I can keep in practice and present plans which are reasonably well thought-out.  It also helps to have thought through things; sometimes a rejected plan will suddenly look like a gem due to some other result or change in priorities.Atop that, I also sketch up a few which have nothing to do with my day job.  Partly it is fun, and partly it is a way to exercise the process on areas I can be certain of my objectivity.  A related exercise is to sketch out a business plan; I end up doing that a few times a year.  Nev...</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4829169</comments>
            <pubDate>Fri, 13 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4829169</guid>        </item>
        <item>
            <title>Blogger glitch</title>
            <link>http://www.medworm.com/index.php?rid=4821060&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Fblogger-glitch.html</link>
            <description>Blogger experienced an outage which I found out about today. &amp;nbsp;This caused my last post to disappear; any posts or comments in a certain time period were lost.Thanks go out to the anonymous reader who posted a comment wondering where my last post went. &amp;nbsp;Blogger (Google) did not notify me of the issue. &amp;nbsp;The really disconcerting part is the Google cached version of the post was missing as well -- this suggested something very odd going on.I often write posts directly in Blogger, so I don't have a backup. &amp;nbsp;This was an exception &amp; I did actually have the rough draft, which I would have posted if the old one wasn't restored. (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4821060</comments>
            <pubDate>Thu, 12 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4821060</guid>        </item>
        <item>
            <title>Ion's Growing Pains</title>
            <link>http://www.medworm.com/index.php?rid=4813548&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Fions-growing-pains.html</link>
            <description>A recent In Sequence article indicated that Ion Torrent is enjoying strong initial sales.  This bodes well for continued evolution and improvement of the technology, as LIFE will continue to smell revenues and opportunity.  Ion has announced a number of improvements, but most aren&amp;#39;t scheduled to arrive until the near future.The challenge is for LIFE to keep executing on their plan.  Already some issues have arisen; my own Ion experiment at a service provider is stalled due to a back-ordered template prep reagent (two weeks and counting!). This is a key reason to do pilots on emerging technologies with non-critical (but interesting) samples; I was bitten last fall by another backorder bug (that time, the paired end SOLiD reagents).  Of course, this time it is even more complicated, ...&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4813548</comments>
            <pubDate>Tue, 10 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4813548</guid>        </item>
        <item>
            <title>Analyzing exome data with KNIME</title>
            <link>http://www.medworm.com/index.php?rid=4813549&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F05%2Fanalyzing-exome-data-with-knime.html</link>
            <description>A presentation I gave last week in my lab Analyzing Exome Data with KNIME  View more presentations from Pierre Lindenbaum (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4813549</comments>
            <pubDate>Tue, 10 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4813549</guid>        </item>
        <item>
            <title>Towards mastery: deliberative practice, flow and personality traits</title>
            <link>http://www.medworm.com/index.php?rid=4803393&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fbooks%2Ftowards-mastery-deliberative-practice-flow-and-personality-traits</link>
            <description>By now, you&amp;#8217;ve probably come across the factoid that you need 10,000 hours of practice to master any activity (thanks Malcolm Gladwell). Although it&amp;#8217;s good that people are finally getting over the idea of a shortcut to success, this oft-quoted figure of 10000 hours does not really answer the question whether mastery is innate or acquired. After all, plenty of people have plodded away at activities for more than 10000 hours without getting to any recognizable level of mastery. It&amp;#8217;s a topic that I&amp;#8217;ve read around a lot in and here I&amp;#8217;d like to throw down some connections between practice, neural plasticity, flow and innate psychological traits.

	Deliberative practice

	In &amp;#8220;Talent is Overrated,&amp;#8221; Geoff Colvin argues that it&amp;#8217;s not just any kind of ...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4803393</comments>
            <pubDate>Mon, 09 May 2011 13:53:27 +0100</pubDate>
            <guid isPermaLink="false">4803393</guid>        </item>
        <item>
            <title>Ion Torrent's Data Quality Is Pretty Good (and Better Than Ion Claims)</title>
            <link>http://www.medworm.com/index.php?rid=4803392&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Fion-torrents-data-quality-is-pretty.html</link>
            <description>One of the key questions around Ion Torrent, as with all new platforms, is what is the sequence data quality like.  Now, that can be a loaded question but I&amp;#39;ll ask a slight variant on it: how truthful (or accurate) is Ion at estimating base quality?Quality scores are a useful adjunct to sequencing data and are commonly expressed as phred scores, which are the integer part of -10*log10 of the error probability.  Any base caller needs to estimate these and many downstream programs, from aligners to assemblers to variant callers, rely on these quality values for their operations.  In many cases, the individual quality scores are combined to generate some joint estimate of the error (I built one such model at Codon).  These error probabilities come not from an infallible source, but ar...</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4803392</comments>
            <pubDate>Sat, 07 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4803392</guid>        </item>
        <item>
            <title>Which numbers did I use this time?</title>
            <link>http://www.medworm.com/index.php?rid=4797872&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Fwhich-numbers-did-i-use-this-time.html</link>
            <description>Noted screenwriter William Goldman&amp;#39;s second memoir on the film industry is titled &amp;quot;Which Lie Did I Tell?&amp;quot;.  The title is not a quote from Goldman, but rather what another movie industry said after getting off a long phone call he had taken in Goldman&amp;#39;s presence.  I&amp;#39;m a bit nervous I inadvertently strayed in that direction in my last item on Pacific Biosciences.I was relying on memory for my numbers &amp; in the course of writing things I think I also revised those downwards, not out of malice but rather an attempt to be conservative.  As a correspondent pointed out, the first pass accuracy on the first commercial system is claimed to be 85%, not 80% as I stated.  On the number of reads, I failed to update for the newer SMRT cells which are out; I said 10K and it...</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4797872</comments>
            <pubDate>Fri, 06 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4797872</guid>        </item>
        <item>
            <title>Changes</title>
            <link>http://www.medworm.com/index.php?rid=4789503&amp;cid=d_132_132_f&amp;fid=35011&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fmndoci%2F%7E3%2F1Hd9KipioOo%2F</link>
            <description>Tweet	
	As some you have probably noticed via a prior tweet and a LinkedIn profile change, I have a new role at AWS, getting back to my product management roots. This gives me a chance to get up close and personal with infrastructure, thinking about the types of applications and workloads people are running, high performance computing, etc. I love product, and it&amp;#8217;s good to be back. Will definitely mean a reduction in travel and speaking engagements, but mostly result in being more picky about those. Maybe this means that c2cbio will make a semi-comeback. We&amp;#8217;ll see.  In the meantime, I am excited. (Source: business|bytes|genes|molecules)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>business|bytes|genes|molecules</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4789503</comments>
            <pubDate>Fri, 06 May 2011 03:02:36 +0100</pubDate>
            <guid isPermaLink="false">4789503</guid>        </item>
        <item>
            <title>Books Read March/April 2011</title>
            <link>http://www.medworm.com/index.php?rid=4780438&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fbooks%2Fbooks-read-marchapril-2011</link>
            <description>9. Rino Breerbaart, &amp;#8220;Song Logic&amp;#8221;

	Reading &amp;#8220;Song-logic&amp;#8221;, a book of essays of music by Rino Breebaart, I felt a slight twinge of jealousy. Ever since I can remember, I have always wanted to write about my own transcendent experiences in music listening. In these hypothetical essays, I would break down my experiences with stylish wit and irresistible acumen. That I haven&amp;#8217;t is clear, but if I did then they would look something like these essays. 

	To write great music criticism, you have to have a sharp ear, a theorizing bent and a poetic tongue. But most of all, you need a sympathetic heart. These essays are written in a mature tone, suffused with a generosity that is absent in much criticism out there. There is no snark, just an honest attempt at trapping the ...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4780438</comments>
            <pubDate>Tue, 03 May 2011 12:11:33 +0100</pubDate>
            <guid isPermaLink="false">4780438</guid>        </item>
        <item>
            <title>How Many More Machines will PacBio Sell This Year?</title>
            <link>http://www.medworm.com/index.php?rid=4775538&amp;cid=d_132_132_f&amp;fid=35003&amp;url=http%3A%2F%2Fomicsomics.blogspot.com%2F2011%2F05%2Fhow-many-more-machines-will-pacbio-sell.html</link>
            <description>Amongst last week&amp;#39;s news is the item that Pacific Biosciences has officially launched their SMRT sequencing platform.  I&amp;#39;d too eye-deep in various projects to figure out how off schedule that is (I think nearly a year from their original target), but now it is launched.  Read more » (Source: Omics! Omics!)</description>
            <author>Omics! Omics!</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4775538</comments>
            <pubDate>Sun, 01 May 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4775538</guid>        </item>
        <item>
            <title>In defense of 'Omics</title>
            <link>http://www.medworm.com/index.php?rid=4762896&amp;cid=d_132_132_f&amp;fid=35013&amp;url=http%3A%2F%2Ffeedproxy.google.com%2F%7Er%2Fpedrobeltrao%2F%7E3%2FfqX8l_F68Ng%2Fin-defense-of-omics.html</link>
            <description>High-throughput&amp;nbsp;studies tend to have a bad reputation. They are often derided as little more than fishing expeditions. Few have summarized these feelings as sharply as Sydney Brenner:
&quot;So we now have a culture which is based on everything must be high-throughput.I like to call it low-input, high-throughput, no-output biology&quot;Having dealt with these type of data for so long, I am often in the strange&amp;nbsp;position of having to defend the approaches. As I was in a real need to procrastinate, I decide to try to write some of these thoughts down.

Error rates
One of the&amp;nbsp;biggest&amp;nbsp;complaints directed at large-scale methods is that they have very high error rates. Usually these complaints come from scientists interested in studying system X or protein Y, that dig into these datasets...</description>
            <author>Public Rambling</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4762896</comments>
            <pubDate>Thu, 28 Apr 2011 21:29:00 +0100</pubDate>
            <guid isPermaLink="false">4762896</guid>        </item>
        <item>
            <title>Multiple Widget Embedding for Jolecule</title>
            <link>http://www.medworm.com/index.php?rid=4758907&amp;cid=d_132_132_f&amp;fid=35030&amp;url=http%3A%2F%2Fboscoh.com%2Fprotein%2Fmultiple-widget-embedding-for-jolecule</link>
            <description>Finally, I've got the Jolecule widgets working properly. That is, multiple widgets working on the same page. That took some fugly global-variable name-mangling meta-programming code. But hey, it's javascript, so you are supposed to write fugly code.



What this means is that you can create annotated views for different proteins, then hot-link it back on your blog/website/journal-article to have fast loading interactive protein widgets. 



I've also added a handy &quot;link&quot; button below every text box of the annotated views, where you can grab the appropriate embedding HTML code for your annotated views. 






If you install multiple widgets, there will be some renaming to be had. If you are embedding the second widget, you will have to change:


 embed1_dict =&gt; embed2_dict...</description>
            <author>Trapped in the USA</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4758907</comments>
            <pubDate>Thu, 28 Apr 2011 02:27:49 +0100</pubDate>
            <guid isPermaLink="false">4758907</guid>        </item>
        <item>
            <title>mapping people I'm following on twitter: (KML, java, geonames.org)</title>
            <link>http://www.medworm.com/index.php?rid=4762897&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F04%2Fmapping-people-im-following-on-twitter.html</link>
            <description>I wrote a java tool to map the people I'm following on twitter. This tool invokes the twitter API to fetch the profiles of my contacts and it uses the geonames web services to guess the geolocation of the places.The source code is available on github at https://github.com/lindenb/jsandbox/blob/master/src/sandbox/TwitterToKML.java.the build.xml is here.Compilationant twitterkml#get your twitter-id (Source: YOKOFAKUN)&lt;div id=&quot;medworm&quot;&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;MedWorm Message:&lt;/i&gt;&lt;/b&gt; Register for&lt;b&gt;&lt;a href=&quot;http://www.medmatcha.com&quot; target =&quot;_self&quot;&gt;MedMatcha, MedWorm's medical advertising network&lt;/a&gt;&lt;/b&gt;, and receive $5 free advertising.&lt;/p&gt;&lt;/div&gt;</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4762897</comments>
            <pubDate>Wed, 27 Apr 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4762897</guid>        </item>
        <item>
            <title>dbNSFP: a lightweight database of human non-synonymous SNPs and their  functional predictions</title>
            <link>http://www.medworm.com/index.php?rid=4762898&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F04%2Fdbnsfp-lightweight-database-of-human.html</link>
            <description>People from the &quot;Human Genetics Center&quot; in Houston have compiled a new resource named dbNSFP and described in http://www.ncbi.nlm.nih.gov/pubmed/21520341.Hum Mutat. 2011 Apr 21. doi:10.1002/humu.21517.dbNSFP: a lightweight database of human non-synonymous SNPs and their functional predictions. Liu X, Jian X, Boerwinkle E.They have compiled the &quot;prediction scores from four new and popular (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4762898</comments>
            <pubDate>Wed, 27 Apr 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4762898</guid>        </item>
        <item>
            <title>Playing with the HTML5 File API: translating a Fasta file.</title>
            <link>http://www.medworm.com/index.php?rid=4742593&amp;cid=d_132_132_f&amp;fid=35022&amp;url=http%3A%2F%2Fplindenbaum.blogspot.com%2F2011%2F04%2Fplaying-with-html5-file-api-translating.html</link>
            <description>In the current post, I'm using the new HTML5 File Api. This new API can read the content of a file on the client side without needing a remote server. Let me repeat this: YOU DO NOT NEED A SERVERYOU DO NOT NEED TO COPY AND PASTE THE CONTENT OF THE FILE IN A TEXTAREA.As an example, the following code reads a whole DNA fasta file stored on your computer and translate each DNA sequence to a protein. (Source: YOKOFAKUN)</description>
            <author>YOKOFAKUN</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4742593</comments>
            <pubDate>Thu, 21 Apr 2011 23:00:00 +0100</pubDate>
            <guid isPermaLink="false">4742593</guid>        </item>
        <item>
            <title>Lists of URLs are so 1990s</title>
            <link>http://www.medworm.com/index.php?rid=4734450&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F04%2F21%2Flists-of-urls-are-so-1990s%2F</link>
            <description>Subtitle: &amp;#8220;Why some projects are not worth your valuable time and skills.&amp;#8221;
Let&amp;#8217;s wrap up this exploration of how to extract URLs associated with NAR Database articles. I&amp;#8217;m tempted to start with the summary: don&amp;#8217;t bother &amp;#8211; just Google it. If you want that, skip to the end.

First: forget PubMed. This query:

&amp;quot;Nucleic Acids Res&amp;quot;[JOUR] &amp;quot;Database issue&amp;quot;[ISS]

is all well and good except that between 1998 and 2004, the NAR Database Issue was not named &amp;#8220;Database issue&amp;#8221;; it just has a volume number, like the other issues.
If you insist on trying to extract URLs from PubMed abstracts, be prepared for frustration. They include every valid variant imaginable (with/without a leading &amp;#8220;http://&amp;#8221;) and many invalid variants (b...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4734450</comments>
            <pubDate>Wed, 20 Apr 2011 14:08:18 +0100</pubDate>
            <guid isPermaLink="false">4734450</guid>        </item>
        <item>
            <title>Factoids (and using R as a simple calculator)</title>
            <link>http://www.medworm.com/index.php?rid=4734451&amp;cid=d_132_132_f&amp;fid=35006&amp;url=http%3A%2F%2Fnsaunders.wordpress.com%2F2011%2F04%2F20%2Ffactoids-and-using-r-as-a-simple-calculator%2F</link>
            <description>Wikipedia defines factoid as &amp;#8220;a questionable or spurious—unverified, incorrect, or fabricated—statement presented as a fact, but with no veracity.&amp;#8221;
Last night I was enjoying a TV documentary series, The Story of Science, when I heard a startling factoid, namely:

If the &amp;#8220;empty space&amp;#8221; inside the atoms that make up people were removed, the entire human population would fit inside a sugar cube.

What the? Can we improve the veracity of this factoid?

Atoms
Let&amp;#8217;s start with atoms. Assuming that atoms are spherical, with a central nucleus which is also spherical, we can calculate atomic and nuclear volumes. A Google search for average diameter of an atomic nucleus suggests that the &amp;#8220;average&amp;#8221; nuclear diameter is 1e-14 metres, ~ 10 000 times less than...</description>
            <author>What You're Doing Is Rather Desperate</author>
            <type>blogs</type>
        <comments>http://www.medworm.com/rss/comments.php?id=4734451</comments>
            <pubDate>Wed, 20 Apr 2011 02:16:11 +0100</pubDate>
            <guid isPermaLink="false">4734451</guid>        </item>
    </channel>
</rss>

