programming polls

1. A big increase in SAS user participation in 2014, perhaps partly driven by growth and change in KDnuggets readers composition, and likely also by increased visibility of this poll among SAS users. SAS users had a high percentage of "lone" votes - in 2014, 58% of them said they used only SAS, compared to 26% in 2013. The fraction of "lone" votes in 2014 was 20.5% for R, 14% for Python, and only 4.5% for SQL. 

2. Consolidation among top 4 languages - R, SAS, Python, and SQL. 91% of all voters have used at least one of them. Almost all other languages declined in their popularity for data mining tasks, including Java, Unix shell, MATLAB, C/C++, Perl, Octave, Ruby, Lisp, and F#. 

Here is a Venn diagram that shows significant overlap between R, Python, and SQL. The percentages indicated how many voters chose that option, eg 20% of all voters have used both R and Python, while 10% have used R, Python, and SQL. The areas of the circles and intersections approximately correspond to the fraction of voters. 

Here is a similar Venn diagram showing overlap between R, Python, and SAS. We see that SAS is much more independent from R and Python, with about 2/3 of of SAS users not using R or Python. 

 

3. Languages with the highest growth in 2014 were

 

4. The languages with the largest decline in share of usage were

 

Here is the table with more details: 

What programming/statistics languages you used for an analytics / data mining / data science work in 2014?

Language used

R (352 voters in 2014)

SAS (262)

Python (252)

SQL (220)

Java (89)

Unix shell/awk/sed (63)

Pig Latin/ Hive/ other Hadoop-based languages (61)

SPSS (58)

MATLAB (45)

Scala (28)

C/C++ (26)

Julia (21)

Other low-level languages (20)

Perl (19)

GNU Octave (17)

Ruby (9)

Lisp/Clojure (5)

F# (0)

 % voters in 2014 (719 total) 

 % voters in 2013 (713 total) 

 % voters in 2012 (579 total)

 49.0%

 60.9%

 52.5%

 36.4%

 20.8%

 19.7%

 35.0%

 38.8%

 36.1%

 30.6%

 36.6%

 32.1%

 12.4%

 16.5%

 21.2%

 8.8%

 11.1%

 14.7%

 8.5%

 8.0%

 6.7%

 8.1%

not asked

not asked

 6.3%

 12.5%

 13.1%

 3.9%

 2.2%

 2.4%

 3.6%

 9.3%

 14.3%

 2.9%

 0.7%

 0.3%

 2.8%

 5.9%

 11.4%

 2.6%

 4.5%

 9.0%

 2.4%

 5.6%

 5.9%

 1.3%

 2.2%

 3.8%

 0.7%

 1.0%

 4.3%

0%

 1.7%

not asked in 2012

Among other programming languages William Dwinnell mentioned Compiled BASIC (PowerBASIC). 

Regional participation was