SciCombinator

Discover the most talked about and latest scientific content & concepts.

Concept: Microsoft

878

The spreadsheet software Microsoft Excel, when used with default settings, is known to convert gene names to dates and floating-point numbers. A programmatic scan of leading genomics journals reveals that approximately one-fifth of papers with supplementary Excel gene lists contain erroneous gene name conversions.

Concepts: VisiCalc, Pivot table, Lotus 1-2-3, Spreadsheet software, Microsoft Office, Microsoft Excel, Spreadsheet, Microsoft

204

The choice of an efficient document preparation system is an important decision for any academic researcher. To assist the research community, we report a software usability study in which 40 researchers across different disciplines prepared scholarly texts with either Microsoft Word or LaTeX. The probe texts included simple continuous text, text with tables and subheadings, and complex text with several mathematical equations. We show that LaTeX users were slower than Word users, wrote less text in the same amount of time, and produced more typesetting, orthographical, grammatical, and formatting errors. On most measures, expert LaTeX users performed even worse than novice Word users. LaTeX users, however, more often report enjoying using their respective software. We conclude that even experienced LaTeX users may suffer a loss in productivity when LaTeX is used, relative to other document preparation systems. Individuals, institutions, and journals should carefully consider the ramifications of this finding when choosing document preparation strategies, or requiring them of authors.

Concepts: Microsoft, Text editor, Microsoft Office, Research and development, Microsoft Word, Word processor, Research

183

There is an ever growing number of molecular phylogenetic studies published, due to, in part, the advent of new techniques that allow cheap and quick DNA sequencing. Hence, the demand for relational databases with which to manage and annotate the amassing DNA sequences, genes, voucher specimens and associated biological data is increasing. In addition, a user-friendly interface is necessary for easy integration and management of the data stored in the database back-end. Available databases allow management of a wide variety of biological data. However, most database systems are not specifically constructed with the aim of being an organizational tool for researchers working in phylogenetic inference. We here report a new software facilitating easy management of voucher and sequence data, consisting of a relational database as back-end for a graphic user interface accessed via a web browser. The application, VoSeq, includes tools for creating molecular datasets of DNA or amino acid sequences ready to be used in commonly used phylogenetic software such as RAxML, TNT, MrBayes and PAUP, as well as for creating tables ready for publishing. It also has inbuilt BLAST capabilities against all DNA sequences stored in VoSeq as well as sequences in NCBI GenBank. By using mash-ups and calls to web services, VoSeq allows easy integration with public services such as Yahoo! Maps, Flickr, Encyclopedia of Life (EOL) and GBIF (by generating data-dumps that can be processed with GBIF’s Integrated Publishing Toolkit).

Concepts: Relational database, Relational model, Microsoft, Biology, Molecular biology, SQL, DNA, Database

58

The rapid expansion of direct-to-consumer wearable fitness products (eg, Flex 2, Fitbit) and research-grade sensors (eg, SenseCam, Microsoft Research; activPAL, PAL Technologies) coincides with new opportunities for biomedical and behavioral researchers. Underserved communities report among the highest rates of chronic disease and could benefit from mobile technologies designed to facilitate awareness of health behaviors. However, new and nuanced ethical issues are introduced with new technologies, which are challenging both institutional review boards (IRBs) and researchers alike. Given the potential benefits of such technologies, ethical and regulatory concerns must be carefully considered.

Concepts: Microsoft Research, Microsoft, Innovation, Bioethics, Science, Medicine

34

Teaching bioinformatics at universities is complicated by typical computer classroom settings. As well as running software locally and online, students should gain experience of systems administration. For a future career in biology or bioinformatics, the installation of software is a useful skill. We propose that this may be taught by running the course on GNU/Linux running on inexpensive Raspberry Pi computer hardware, for which students may be granted full administrator access.

Concepts: Microsoft, Course, Higher education, History of education, Student, School, Teacher, Education

29

Interactive modules for Data Exploration and Visualization (imDEV) is a Microsoft Excel spreadsheet embedded application providing an integrated environment for the analysis of omics data through a user-friendly interface. Individual modules enables interactive and dynamic analyses of large data by interfacing R’s multivariate statistics and highly customizable visualizations with the spreadsheet environment, aiding robust inferences and generating information-rich data visualizations. This tool provides access to multiple comparisons with false discovery correction, hierarchical clustering, principal and independent component analyses, partial least squares regression and discriminant analysis, through an intuitive interface for creating high-quality two- and a three-dimensional visualizations including scatter plot matrices, distribution plots, dendrograms, heat maps, biplots, trellis biplots and correlation networks. Availability and implementation: Freely available for download at http://sourceforge.net/projects/imdev/. Implemented in R and VBA and supported by Microsoft Excel (2003, 2007 and 2010).

Concepts: Regression analysis, Visual Basic for Applications, Lotus 1-2-3, Graphical user interface, Microsoft Office, Spreadsheet, Microsoft, Microsoft Excel

28

Enzyme kinetic parameters are usually determined from initial rates nevertheless, laboratory instruments only measure substrate or product concentration versus reaction time (progress curves). To overcome this problem we present a methodology which uses integrated models based on Michaelis-Menten equation. The most severe practical limitation of progress curve analysis occurs when the enzyme shows a loss of activity under the chosen assay conditions. To avoid this problem it is possible to work with the same experimental points utilized for initial rates determination. This methodology is illustrated by the use of integrated kinetic equations with the well-known reaction catalyzed by alkaline phosphatase enzyme. In this work nonlinear regression was performed with the Solver supplement (Microsoft Office Excel). It is easy to work with and track graphically the convergence of SSE (sum of square errors). The diagnosis of enzyme inhibition was performed according to Akaike information criterion.

Concepts: Microsoft Office 2008 for Mac, Microsoft Office 2007, Spreadsheet, Enzyme kinetics, Microsoft Office, Microsoft Excel, Microsoft, Enzyme

27

DIYABC is a software package for a comprehensive analysis of population history using approximate Bayesian computation (ABC) on DNA polymorphism data. Version 2.0 implements a number of new features and analytical methods. It allows: (i) the analysis of single nucleotide polymorphism (SNP) data at large number of loci, apart from microsatellite and DNA sequence data; (ii) efficient Bayesian model choice using linear discriminant analysis on summary statistics; and (iii) the serial launching of multiple post-processing analyses. DIYABC v2.0 also includes a user-friendly graphical interface with various new options. It can be run on three operating systems: GNU/Linux, Microsoft Windows and Apple Os X.

Concepts: Mathematical analysis, Linux, Apple Inc., Graphical user interface, Microsoft, Mac OS X, Microsoft Windows, Operating system

26

Excel2Genie, a simple and user-friendly Microsoft Excel interface, has been developed to the Genie-2000 Spectroscopic Software of Canberra Industries. This Excel application can directly control Canberra Multichannel Analyzer (MCA), process the acquired data and visualize them. Combination of Genie-2000 with Excel2Genie results in remarkably increased flexibility and a possibility to carry out repetitive data acquisitions even with changing parameters and more sophisticated analysis. The developed software package comprises three worksheets: display parameters and results of data acquisition, data analysis and mathematical operations carried out on the measured gamma spectra. At the same time it also allows control of these processes. Excel2Genie is freely available to assist gamma spectrum measurements and data evaluation by the interested Canberra users. With access to the Visual Basic Application (VBA) source code of this application users are enabled to modify the developed interface according to their intentions.

Concepts: Computer software, CP/M, Microsoft Excel, Microsoft Office, Microsoft, Computer program, Spreadsheet, Visual Basic for Applications

26

A hydrochemical facies evolution diagram (HFE-D) is a multirectangular diagram, which is a useful tool in the interpretation of sea water intrusion processes. This method note describes a simple method for generating an HFE-D plot using the spreadsheet software package, Microsoft Excel. The code was applied to groundwater from the alluvial coastal plain of Grosseto (Tuscany, Italy), which is characterized by a complex salinization process in which sea water mixes with sulfate or bicarbonate recharge water.

Concepts: Visual Basic for Applications, Spreadsheet software, Water, Lotus 1-2-3, Spreadsheet, Microsoft Excel, Microsoft, Microsoft Office