博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
两个行业的故事:编程语言与富裕国家和发展中国家之间的差异
阅读量:4044 次
发布时间:2019-05-24

本文共 4299 字,大约阅读时间需要 14 分钟。

A Tale of Two Industries: How Programming Languages Differ Between Wealthy and Developing Countries

技术与人均国内生产总值相关

在最近的一篇文章中,我们看到Android的流量问题(占国家堆栈溢出访问的百分比)往往与一个国家的人均GDP负相关。 这可能会导致我们想知道任何其他标签是否也是如此。
当我们探索主要的编程语言和平台时,除了Android之外,其他一些表现出来包括PHP,Python和R.

Android和PHP流量的数量与国家的收入呈负相关,而Python和R则呈正相关。在每种情况下,我们可以看到例外(韩国使用的Android比我们预期的要多,而中国更多的是Python),但一般来说,相关性很强。 (每个具有约0.5-6的R2,对于多次测试进行调整后,p值<10-6)。

我们会强调,我们在这里并不表示任何因果关系。我们当然不是说编程语言选择影响到一个国家的平均收入,但是我们也不是说一个国家的财富直接影响到他们对技术的使用。我们怀疑司机可能会混合经济和社会因素(教育水平,软件行业的年龄,外包水平),这通常与一国的财富相关。
我们如何将软件开发行业分为两个?
当我们研究趋势时,谈论两组国家(高收入和非高收入)是有用的,而不是考虑一堆相关性。作为一个有用的预先存在的分类,我们可以使用世界银行的收入分类,这是根据人均国民总收入(国民总收入)(参见这里讨论这一分类)。

There are 78 , largely made up of the US and Canada, Western Europe, parts of the Middle East and East Asia, and Australia/New Zealand. I’ve done some analyses of the fundamental drivers of the between-country variation (such as ) that suggest this is a reasonable division, and that it’s more meaningful than other ways we could divide them, such as Eastern vs Western Hemisphere. (For instance, Australia is generally more similar to the US and Europe in terms of visited technologies than it is to China or Indonesia).

The division splits Stack Overflow traffic into groups of about two-thirds and one-third: 63.7% of Stack Overflow’s traffic comes from high income countries. (This likely is due to a combination of greater proportion of software development, more widespread internet access, and a disproportionate share of English-speakers). Much of the traffic from non-high-income countries comes from India, followed by Brazil, Russia, and China.

How do high-income countries differ in the technologies they use?

We’ve now divided the software development world into two segments. How do high-income and non-high-income countries differ in terms of the technologies they use?

We can extract several interesting insights:

  • Difference in data science technologies: As we saw earlier, Python and R are associated with a country’s income. Python is visited about twice as often in high-income countries as in the rest of the world, and R about three times as much. We might also notice that among the smaller tags, many of the greatest shifts are in scientific Python and R packages such as , ,  and . This suggests that part of the income gap in these two languages may be due to their role in science and academic research. It makes sense these would be more common in wealthier industrialized nations, where scientific research makes up a larger portion of the economy and programmers are more likely to have advanced degrees.

  • C/C++: C/C++ are two other notable languages that tend to be visited from high-income countries. One hypothesis is that this may have to do with education: as we saw in a , C and C++ are among the languages more disproportionately visited from American universities. It could also be related to the geographic distribution of the electronics and manufacturing industries.

  • PHP and Android: We explored Android development around the world in a , but PHP is another technology that’s notably associated with lower-income countries. It’s interesting to see that CodeIgniter, a PHP open source framework, is the tag that’s singularly most disproportionately visited from lower-income countries, by a large margin. Further examination shows it is especially heavily visited in South/Southeast Asia (particularly India, Indonesia, Pakistan and the Philippines) while it has very little traffic from the US and Europe. It’s possible that CodeIgniter is a common choice for outsourcing firms building websites.

Conclusion: why does this matter?

I was certainly interested in these results as a fun fact about the programming language ecosystem. But it also has implications for other data explorations we’ll be publishing in the near future.

When we ask questions about the software development industry, it’s important to know that we’re really answering two separate questions that have been “blended” together, and that separating them can sometimes give us more informative answers.

For example, we’re often interested in understanding which technologies drive the most traffic, such as . If we were to create a list of the most visited programming technologies, it would be different for high-income and low-income countries:

转载地址:http://qcrdi.baihongyu.com/

你可能感兴趣的文章
CentOS、Ubuntu、Debian三个linux比较异同
查看>>
javascript闭包和闭包的几种写法及用途
查看>>
Js作用域与作用域链详解
查看>>
nginx下 499错误
查看>>
网络性能测试工具iperf详细使用图文教程
查看>>
MacOSX上ab并发测试常见报错及解决办法
查看>>
为你的网站开启 gzip 压缩功能(nodejs、nginx)
查看>>
网页性能管理详解
查看>>
try catch 对代码运行的性能影响
查看>>
Koa框架实践与中间件原理剖析
查看>>
node.js 资料收集
查看>>
解除 Linux 系统的最大进程数和最大文件打开数限制
查看>>
怎样才是一个基本水平的java程序员?
查看>>
UGC,PGC,OGC
查看>>
一道关于Promise应用的面试题
查看>>
Couchbase 介绍 - 更好的 Cache 系统
查看>>
Memcached Redis Membase性能测试对比分析
查看>>
couchbase 与 redis的横向对比
查看>>
缓存的进化之路—Couchbase的分布式架构
查看>>
Chrome渲染分析之Timeline工具的使用
查看>>