Tag Archives: data science

March 2018 in Review

Most frightening stories:

Most hopeful stories:

  • One large sprawling city could be roughly the economic equivalent of several small high-density cities. This could potentially be good news for the planet if you choose in favor of the latter, and preserve the spaces in between as some combination of natural land and farm land.
  • The problems with free parking, and solutions to the problems, are well known. This could potentially be good news if anything were to be actually done about it. Self-parking cars could be really fantastic for cities.
  • The coal industry continues to collapse, and even the other fossil fuels are saying they are a bunch of whining losers. And yes, I consider this positive. I hope there aren’t too many old ladies whose pensions depend on coal at this point.

Most interesting stories, that were not particularly frightening or hopeful, or perhaps were a mixture of both:

2017 in Review

Most frightening stories of 2017:

  • January: The U.S. government may be “planning to roll back or dilute many of the provisions of Dodd-Frank, particularly those that protect consumers from toxic financial products and those that impose restrictions on banks”.
  • February: The Doomsday Clock was moved to 2.5 minutes to midnight. The worst it has ever been was 2 minutes to midnight in the early 1980s. In related news, the idea of a U.S.-China war is looking a bit more plausible. The U.S. military may be considering sending ground troops to Syria.
  • MarchLa Paz, Bolivia, is in a serious crisis caused by loss of its glacier-fed water supply. At the same time we are losing glaciers and snowpack in important food-growing regions, the global groundwater situation is also looking bleak. And for those of us trying to do our little part for water conservation, investing in a residential graywater system can take around 15 years to break even at current costs and water rates.
  • April: The U.S. health care market is screwed up seemingly beyond repair. Why can’t we have nice things? Oh right, because our politicians represent big business, not voters. Also, we have forgotten the difference between a dialog and an argument.
  • May: We hit 410 ppm at Mauna Loa.
  • JuneThe Onion shared this uncharacteristically unfunny observation: “MYTH: There is nothing mankind can do to prevent climate change. FACT: There is nothing mankind will do to prevent climate change”. It’s not funny because it’s probably true.
  • July: Long term food security in Asia could be a problem.
  • August: The U.S. construction industry has had negligible productivity gains in the past 40 years.
  • September: During the Vietnam War the United States dropped approximately twice as many tons of bombs in Southeast Asia as the Allied forces combined used against both Germany and Japan in World War II. After the Cold War finally ended, Mikhail Gorbachev made some good suggestions for how to achieve a lasting peace. They were ignored. We may be witnessing the decline of the American Empire as a result.
  • October: It is possible that a catastrophic loss of insects is occurring and that it may lead to ecological collapse. Also, there is new evidence that pollution is harming human health and even the global economy more than previously thought.
  • November: I thought about war and peace in November. Well, mostly war. War is frightening. The United States of America appears to be flailing about militarily all over the world guided by no foreign policy. Big wars of the past have sometimes been started by overconfident leaders thinking they could get a quick military victory, only to find themselves bogged down in something much larger and more intractable than they imagined. But enemies are good to have – the Nazis understood that a scared population will believe what you tell them.
  • December: A lot of people would probably agree that the United States government is becoming increasingly dysfunctional, but I don’t think many would question the long-term stability of our form of government itself. Maybe we should start to do that. The Consumer Financial Protection Bureau has been doing a decent job of protecting consumers and reducing the risk of another financial crisis. The person in charge of it now was put there specifically to ruin it. Something similar may be about to happen at the Census Bureau. A U.S. Constitutional Convention is actually a possibility, and might threaten the stability of the nation.

Most hopeful stories of 2017:

Most interesting stories that weren’t particularly frightening or hopeful, or perhaps were a mixture of both:

  • January: Apple, Google, and Facebook may destroy the telecom industry.
  • February: The idea of growing human organs inside a pig, or even a viable human-pig hybrid, is getting very closeTiny brains can also be grown on a microchip. Bringing back extinct animals is also getting very close.
  • March: Bill Gates has proposed a “robot tax”. The basic idea is that if and when automation starts to increase productivity, you could tax the increase in profits and use the money to help any workers displaced by the automation. In related somewhat boring economic news, there are a variety of theories as to why a raise in the minimum wage does not appear to cause unemployment as classical economic theory would predict.
  • April: I finished reading Rainbow’s End, a fantastic Vernor Vinge novel about augmented reality in the near future, among other things.
  • May: The sex robots are here.
  • June: “Fleur de lawn” is a mix of perennial rye, hard fescue, micro clover, yarrow, Achillea millefolium, sweet alyssum, Lobularia maritima, baby blue eyes, Nemophila menziesi, English daisy, Bellis perennis, and O’Connor’s strawberry clover, Trifolium fragiferum.
  • July: Ecologists have some new ideas for measuring resilience of ecosystems. Technologists have some wild ideas to have robots directly counteract the effects of humans on ecosystems. I like ideas – how do I get a (well-compensated) job where I can just sit around and think up ideas?
  • August: Elon Musk has thrown his energy into deep tunneling technology.
  • September: I learned that the OECD Science, Technology and Innovation Outlook named “ten key emerging technology trends”: The Internet of Things, Big data analytics, Artificial intelligence, Neurotechnologies, Nano/microsatellites, Nanomaterials, Additive manufacturing / 3D printing, Advanced energy storage technologies, Synthetic biology, Blockchain
  • October: Even if autonomous trucks are not ready for tricky urban situations, they could be autonomous on the highway with a small number of remote-control drivers guiding a large number of tricks through tricky urban maneuvers, not unlike the way ports or trainyards are run now. There is also new thinking on how to transition highways gradually through a mix of human and computer-controlled vehicles, and eventually to full computer control. New research shows that even a small number of autonomous vehicles mixed in with human drivers will be safer for everyone. While some reports predict autonomous taxis will be available in the 2020s, Google says that number is more like 2017.
  • November: It’s possible that the kind of ideal planned economy envisioned by early Soviet economists (which never came to pass) could be realized with the computing power and algorithms just beginning to be available now.
  • DecemberMicrosoft is trying to one-up Google Scholar, which is good for researchers. More computing firepower is being focused on making sense of all the scientific papers out there.

I’ll keep this on the short side. Here are a few trends I see:

Risk of War. I think I said about a year ago that if we could through the next four years without a world war or nuclear detonation, we will be doing well. Well, one year down and three to go. That’s the bright side. The dark side is that it is time to acknowledge there is a regional war going on in the Middle East. It could escalate, it could go nuclear, and it could result in military confrontation between the United States and Russia. Likewise, the situation in North Korea could turn into a regional conflict, could go nuclear, and could lead to military confrontation between the United States and China.

Decline…and Fall? A question on my mind is whether the United States is a nation in decline, and I think the surprisingly obvious answer is yes. The more important question is whether it is a temporary dip, or the beginning of a decline and fall.

Risk of Financial Crisis. The risk of another serious financial crisis is even scarier that war in some ways, at least a limited, non-nuclear war. Surprisingly, the economic effects can be more severe, more widespread and longer lasting. We are seeing the continued weakening of regulations attempting to limit systemic risk-taking for short-term gain. Without a pickup in long-term productivity growth and with the demographic and ecological headwinds that we face, another crisis equal to or worse than the 2007 one could be the one that we don’t recover from.

Ecological Collapse? The story about vanishing insects was eye-opening to me. Could global ecosystems go into a freefall? Could populous regions of the world face a catastrophic food shortage? It is hard to imagine these things coming to a head in the near term, but the world needs to take these risks seriously since the consequences would be so great.

Technology. With everything else going on, technology just marches forward, of course. One technology I find particularly interesting is new approaches to research that mine and attempt to synthesize large bodies of scientific research.

Can the human species implement good ideas? Solutions exist. I would love to end on a positive note, but at the moment I find myself questioning whether our particular species of hairless ape can implement them.

But – how’s this for ending on a positive note – like I said at the beginning, the one thing about 2017 that definitely didn’t suck was that we didn’t get blown up!

alternatives to word clouds

I like this post on R bloggers proposing several alternatives to word clouds. I’ll list them below but really, you should look at the pictures because hey, this is about pictures.

  1. circle packing (basically this replaces the words with circles, dealing with the problem of bigger/longer words appearing to be more important in standard word clouds); there is a variation on this called the “horn of plenty” where the circles are arranged in order rather than randomly
  2. cartogram (in my ignorance, I have been calling this a “bubble map”. I have used these frequently to show engineering model results and find they work well for many people)
  3. chloropleth (these shade in geographic areas to convey data. I find these work well if the size of the geographic area is important information. If it is not, these tend to draw the viewer’s eye to larger areas, and in that case the bubbles are better. For example, per-person income of Luxembourg vs. China.)
  4. treemap (I’ve been calling these “packed rectangles” and I generally find them good for anything where conveying relative magnitudes of things to people is important)
  5. donuts (surpringly, the author concludes a donut is the best option for the data he is trying to show and I kind of agree, it gets the point across and leaves lots of room for labels)

The article has links to the specific packages and code used to create the graphics.

August 2017 in Review

Most frightening stories:

  • Around 200,000 people may be dying prematurely in the U.S. each year due to air pollution. Meanwhile, the Trump administration may be trying to censor the National Climate Assessment, which presents the consensus among serious scientists in the United States government that climate change is very real and a very real threat to our country.
  • The U.S. may already be in the middle of a soft military coup. We have a batshit-crazy President playing nuclear chicken with all our lives. And with the legislative branch not even trying to do anything about this, we are actually hoping the generals who are running our country will be the coolest heads in the room when it comes to preventing nuclear war. North Korea may be closer to submarine-launched nuclear weapons than previously thought. Meanwhile, there are three ways for terrorists or other non-state actors to get their hands on nuclear weapons: “transfer—the sale or handoff of a weapon from a nuclear-weapon state; leakage—the theft of a nuclear weapon or weapons-grade fissile material; and indigenous production—the construction of a nuclear device from illicitly obtained weapons-grade fissile material.” And the U.S. and Russia are no longer cooperating on non-proliferation.
  • The U.S. construction industry has had negligible productivity gains in the past 40 years.

Most hopeful stories:

  • The United Nations General Assembly adopted a resolution (in July) that could eventually, maybe lead to the total elimination of nuclear weapons on Earth.
  • The Aichi Biodiversity Targets are some very specific numerical targets that have incorporated in the 2015 Sustainable Goals.
  • Great Transitions are ideas for how the world could transition to a sustainable state without going through a major setback along the way.

Most interesting stories, that were not particularly frightening or hopeful, or perhaps were a mixture of both:

  • Elon Musk has thrown his energy into deep tunneling technology.
  • When you sow seeds, it makes sense to sow the ones that have the most trouble establishing at the highest density.
  • You can use R to recreate the famous plot of Napoleon’s invasion of Russia.

As I am writing these words on Labor Day, the news is about a North Korean nuclear test. In a strange coincidence, I happened to see both the USS New Jersey, which was involved in the Korean War, and the Korean War Memorial here in Philadelphia yesterday while waiting for a bus (which never came). That war caused a lot of pain and suffering on all sides. It would be a tragedy to let it flair up again, and an even bigger tragedy if nuclear weapons were to be involved.

eyes on the street

A group at the University of Pennsylvania looked for statistical evidence that “eyes on the street” are a deterrent to crime. The results are a bit puzzling, as real world data often can be.

ANALYSIS OF URBAN VIBRANCY AND SAFETY IN PHILADELPHIA

Statistical analyses of urban environments have been recently improved through publicly available high resolution data and mapping technologies that have adopted across industries. These technologies allow us to create metrics to empirically investigate urban design principles of the past half-century. Philadelphia is an interesting case study for this work, with its rapid urban development and population increase in the last decade. We focus on features of what urban planners call vibrancy: measures of positive, healthy activity or energy in an area. Historically, vibrancy has been very challenging to measure empirically. We explore the association between safety (violent and non-violent crime) and features of local neighborhood vibrancy such as population, economic measures and land use zoning. Despite rhetoric about the negative effects of population density in the 1960s and 70s, we find very little association between crime and population density. Measures based on land use zoning are not an adequate description of local vibrancy and so we construct a database and set of measures of business activity in each neighborhood. We employ several matching analyses within census block groups to explore the relationship between neighborhood vibrancy and safety at a higher resolution. We fi nd that neighborhoods with more vacancy have higher crime but within neighborhoods, crimes tend not to be located near vacant properties. We also find that more crimes occur near business locations but businesses that are active (open) for longer periods are associated with fewer crimes.

This is particularly fascinating to me because I live my life in the middle of this particular data set and am part of it. So it is very interesting to compare what the data seem to be saying with my own experiences and impressions.

The lack of correlation between population density and crime is not surprising. Two neighborhoods with identical density can be drastically different. The correlation between poverty and crime is not surprising – people who are not succeeding in the formal economy and who are not mobile turn to the informal economy, in other words drug dealing, loan sharking and other illegal ways of trying to earn an income. If they are successful at earning an income, they tend to have a lot of cash around, and other people who know about the cash will take advantage of them, knowing they will not go to the police. Other than going to the police, the remaining options are to be taken advantage of repeatedly, or to retaliate. This is how violence escalates, I believe, and it goes hand in hand with development of a culture that tolerates and even celebrates violence, in a never-ending feedback loop.

The puzzling part comes when they try to drill down and look at explanatory factors at a very fine spatial scale. They found a correlation between crime and mixed use zoning, which appears to contradict the idea that eyes on the street around the clock will help to deter crime. And they found more crime around businesses like cafes, restaurants, bars and retail shops. They found that longer open hours seemed to have some deterrent effect on crime relative to shorter open hours.

I think they have made an excellent effort to do this, and I am not sure it can be done a lot better, but I will point out one idea I have. They talk about some limitations and nuances of their data, but one they do not mention is the idea that they are looking at reported crimes, most likely police reports or 911 calls. It could be that business owners, staff and patrons are much more likely to call 911 and report a crime than are residential neighbors. The business staff and patrons may see this as being in the economic interest, increasing the safety of their families, and the (alleged) criminals they are reporting are generally strangers. In quieter all-residential neighborhoods, people may not observe as many of the crimes that do occur (fewer “eyes on the street”), they may prefer not to report crimes either through a sense of loyalty to one’s neighbors, minding one’s own business, quid pro quo, or in some cases a fear of retaliation. There is also the factor of some demographic groups trusting the police more than others, although the authors’ statistical attempts to control for demographics may tend to factor this out.