Blogs

  • R vs Python

    R or Python? A Head To Head Infographic from DataCamp

    Saturday, May 30, 2015 - 09:30
    If you're a data scientist, you almost certainly use one or both of the data analysis languages, Python and R. Both are freely-available open source, and both have large and active world-wide communities that extend the core platforms. Unlike other data science infographics, this one from DataCamp absolutely nails the strengths and weaknesses of the competitors. Not surprisingly, Inquidia data scientists are divided between allegiance to Python and R. No doubt, though, both Python and... [more]
  • Steve Miller

    Ruby and R

    Thursday, December 18, 2014 - 08:00
    Like the scripting languages tradition of Perl and Python, Ruby's ideal for many core munging challenges of data science. The combination of Ruby data types with methods, blocks and iterators, makes for very powerful code. But does the rest of Ruby fit the bill? [more]
  • sqoop logo

    Lessons Learned With Sqoop

    Wednesday, November 19, 2014 - 08:00
    Chris Deptula shares some tips and tricks to make Sqoop hum. Take a look at his lessons learned. [more]
  • Steve Miller

    The MIT Analytics Imperative

    Thursday, November 13, 2014 - 14:00
    Steve discusses why some firms are successful with analytics and why some are not. Take a look at this review of The Analytics Mandate. [more]
  • Hadoop File Formats: It's not just CSV anymore

    Wednesday, November 12, 2014 - 09:45
    There are stark performance differences resulting from (im)proper format choices in Hadoop. Unfortunately, there is no single file format that optimizes for all of these concerns. You'll need to understand the trade-offs. [more]
  • avro

    Updated Avro Plugin for PDI Version 2.1.0

    Friday, October 31, 2014 - 08:45
    The Avro Output Plugin for Pentaho Data Integration allows you to output Avro files using Kettle. Avro files are commonly used in Hadoop allowing for schema evolution and truly separating the write schema from the read schema. System Requirements -Pentaho Data Integration 5.0 or above Installation Using Pentaho Marketplace In the Pentaho Marketplace find the Avro Output plugin and click Install Restart Spoon Manual Install Place the AvroOutputPlugin folder in the ${DI_HOME}/plugins/steps directory Restart Spoon... [more]
  • Steve Miller

    Defining Data Scientists & Their Tools

    Thursday, October 30, 2014 - 11:00
    My thoughts of the day involve reactions to two blog entries. The first is titled, “Data Scientists Must Also Be Research Methodology Scientists." The second is "SAS vs. R (vs. Python) – which tool should I learn?" Here's my take on both. The first, cited by Alex Liu from Research Methods and Data Science, references a blog posted by Informatics Professor Bill Hersh of the Oregon Health and Science University. While Hersh is a big... [more]
  • Steve Miller

    Pentaho, Cloudera Executives See Bigger Data Opportunities

    Wednesday, October 22, 2014 - 09:00
    I had the opportunity to spend an hour on a call with Pentaho CEO Quentin Gallivan and Cloudera Chief Strategy Officer Mike Olson immediately following their first day keynotes at Pentaho World 2014. [more]
  • pentaho analyzer image

    Parent-Child Hierarchies with Abnormal Genealogy

    Wednesday, October 15, 2014 - 13:45
    I’m a huge user of Mondrian, the OLAP engine behind Pentaho and Saiku, but there are times when it doesn’t do what I need it to, or exactly what I expect it to. Take this special case I call “Abnormal Genealogy.” [more]
  • Steve Miller

    data.table University

    Tuesday, October 14, 2014 - 13:45
    I've been spending quite a bit of time lately working with the data.table package in R. [more]

Pages

Contact us today to find out how Inquidia can show you how to collect, integrate and enrich your data. We do data. You can, too.

Would you like to know more?

Sign up for our fascinating (albeit infrequent) emails. Get the latest news, tips, tricks and other cool info from Inquidia.