Thursday, September 13, 2012

The Troubled Future of Higher Education

The political scientists Gary King and Maya Sen have just posted an excellent working paper clearly outlining the major problems facing higher education: economic, political, sociological. The main thrust is that, although only 30% of the American population obtains a four-year college degree (thus leaving an untapped 70% who could finish college degrees), the higher education system is facing major constraints due to limited budgets and major technological advances. For example, online sites such as Khan Academy are effectively competing with universities, and for-profit universities are growing at a high rate. I'd add to their list the potential for big data analysis to displace the role of experts; I refer to the effect of sabermetrics on baseball journalists or data mining algorithms on marketers as possible canaries in the cage for academics. Regardless, King and Sen's paper is a must-needed beginning of a discussion about the future of higher education in the wake of profound social changes. After all, it was only a mere decade ago that Time and Newsweek were major cultural institutions in American life.

Thursday, May 31, 2012

Using Indirect Survey Techniques to Measure Zombie Outbreaks?

Zombies are now a common topic of discussion. In fact, the data we have available from Google Trends (for the phrase "zombie attack") strongly suggest an increasing risk of zombification across the world:


 However, academic research on zombies is limited (i.e,. non-existent), mainly because of the lack of high quality data. For those interested in studying zombies, I refer readers to Andrew Gelman's paper (co-written, apparently, by the great zombie film director George Romero) on how to measure zombie outbreaks via indirect survey techniques. You can find his article here. Even if you're not interested in zombies, his paper offers some good ideas on how to sample difficult-to-reach populations more generally.

Friday, May 11, 2012

The Promising Future of Mathematical Sociology

I'm now an occasional blogger at Permutations, the official blog of the Mathematical Sociology Section of the American Sociological Association. You can read my blog post here, in which I outline why I think global trends in information technology and the meta-theroetical foundations of sociology provide conditions for a promising future for sociology in general and mathematical sociology in particular.

Thursday, May 10, 2012

90+ Two-Minute Videos on R

I highly recommend Anthony Damico's excellent two-minute videos on programming in R. You can find the full list of 90+ videos here. This is the first of the series, which tells you how to download and install R:

video

More generally, Anthony's video collection is another reminder of the immense sociological benefits that come from sharing educational materials and expert knowledge in the style of the Khan Academy.

Tuesday, May 08, 2012

Global Online Conference on Statistics

The Consortium for the Advancement of Undergraduate Statistics Education is hosting a global online conference titled "eCOTS: Electronic Conference on Teaching Statistics." You can view the full program here. It only costs $15 to register and participate in the online conference. For at least the past five years I've thought that conferences are obsolete in many respects, so I'm delighted to see this conference developed. By not having a physical place, with food, beverages, and equipment, not to mention lodging and transportation costs, the costs of attendance are much lower, thus enabling more and more people to learn and contribute to knowledge production. (Of course, we'll still want some conferences for face-to-face socialization!)

Sunday, May 06, 2012

I've Converted to R Full-Time

It's been over four years that I've been using both R and Stata, but as of last week I've become an R convert. For several years I had conducted statistical analyses in R (since many complex models can only be programmed in R), but I used Stata before and after the analyses. In essence I'd merge and clean data sets in Stata, call R from Stata for the statistical analyses, export R objects into Stata, and then use Stata's graphics utilities to display the results. This setup quickly unraveled last month when I began merging and recoding data in R, which  is much aided by John Fox's fantastic "car" package.

The problem is that if you want to do Bayesian analysis or graph modeled coefficients (or work with complex data structures more generally), then R is much easier than Stata due to the object-oriented programming environment. It's unbelievably liberating to be able to save vectors, matrices, data frames, and so on from multiple data sources and manipulations in the same conceptual space. Additionally, R has fantastic graphics capabilities (3-D plots, rotating hyperplanes, social network graphs, and so on), offers excellent tools for analyzing and displaying so-called big data (for example, check out the "tabplot" command from Google), and is (frankly) a fun, intuitive programming language. If you need additional reasons to be an R convert, keep in mind that R is completely free, open-source, and extensible, with over 5,300 statistical packages (as of April 2012).

Friday, May 04, 2012

Complex Sociotechnical Systems

In a fascinating, informative talk, the interim director of the Engineering Systems Division at MIT makes the case for a new field of study on complex sociotechnical systems. I ask a question near the end of the video, pointing out that the core concepts of the proposed new field are in fact those endemic to sociology: mixed methods, open systems, social change, and so forth. You can watch the full video here.

Saturday, April 21, 2012

Feynman on Curiosity

This video is one of the effective advertisements I've seen for the value of gathering and systematizing empirical knowledge, by none other than the late Richard Feynman:


Also, since you are probably wondering: the music is Primavera by Ludovico Einaudi.

Tuesday, April 17, 2012

The Future of the Academy in 2032

Just before he died, for a few years I helped the great sociologist Dan Bell with using his computer, and as a result I got to know him very well. One thing I learned from him (besides the distinction between "criticism" and "critique") is the usefulness of prediction as an endeavor in itself (as opposed to explanation). In this spirit, I offer five predictions about the future of the academy in 2032:
  1. First, despite opposition from many established institutions, there will be an enormous increase in open-source education. Classes on any topic will be available online for free, with lecture notes, videos, presentations, and chat services (with other students) available to anyone with a computer. Exemplars of this trend include MIT OpenCourseWare, Khan Academy, and videolectures.net
  2. Second, academic publishing will be increasingly online, with peer review a continuous process. Rather than books and articles published at one time in paper form after a process of peer review, academic projects will be ongoing, process-oriented, available online, and subjected to a continual process of peer review. In essence, everything that academics produce will be works-in-progress, and updated when errors are noted. Early indications of this trend include the NBER archive and  arxiv.org.
  3. Third,due to technological changes and increased monitoring of people's activity, academics will have to be adept with managing and analyzing big data. Common statistical methods will often be difficult to use on such large data sets, straining the computational capacities of computers. While not common in the academy yet, big data is one of the top buzzwords of 2012, and I expect this to spread to academic work relatively soon. An exemplar of this kind of academic work is the Google ngrams project. (One danger, however, is that private corporations might be hostile to information-sharing, and the values of profit-making may severely inhibit the availability of big data to academics.)
  4. Fourth, big ideas will actually be in greater demand in the future. Precisely because there will increasingly be an excess of information, grand theories and master narratives will be increasingly desired to help guide attention, avoid fragmentation of different research traditions, and unify otherwise disparate theories. For example, Josh Tenenbaum's efforts at unifying artificial intelligence (which suffers from disciplinary fragmentation) with probabilistic graphical models is a promising endeavor.
  5. Finally, the skills in demand will be increasingly modular rather than topical. For example, as part of the Cold War in the 1960s, the United States government funded various "area studies" programs to educate Americans on the traditions, customs, and practices of various geographic regions around the world. In the future, there will be less emphasis on this kind of topical knowledge, and greater emphasis on modular skills  such as critical analysis of any kind of texts or arguments, understanding the basic structures of any set of languages, and gathering and analyzing various kinds of qualitative and quantitative data. 
To the extent any of these predictions are correct, sociology is particularly well-suited to take advantage of these trends. Sociologists are generally supportive of the democratic, inclusive principles of open-source education and online publishing, and sociology has an unparalleled tradition of big ideas. Moreover, modularity is ingrained in the discipline; in fact, sociology is almost by definition a modular discipline, inasmuch sociology is an approach to a particular subject matter rather a particular subject matter per se.

Wednesday, April 11, 2012

The Quantified Self

This site on the quantified self shows a small but growing revolution: using quantitative data for self-improvement. I can only expect this to grow in importance. Despite their popularity, means, modes, medians (in their conditional variants as well) simply capture central tendencies, and that there is nearly always substantial heterogeneity within and across populations. Accordingly, basic proscriptions and prescriptions, such as "Take an aspirin a day" may not apply to all individuals, and thus individual tracking is potentially extremely useful. For example, see Seth Robert's blog post on how eating butter might improve cognitive functioning (for him, at the very least).