Posts

Changing forests, Changing climate and Changing economies

  One of the fascinating aspects of working with data in clean technology is how variable the data are over space and time. So, as scientists trying to understand how different systems interact with each other, it usually means that we’re building several models that work together so that both the spatial and temporal aspects are accounted for.     And that’s especially true in the forestry sector. Forests are incredibly important ecosystems - untouched forests in the Amazon, Indonesia, the Congo Basin and other areas sequester carbon, provide habitat for species that cannot be found elsewhere and have been found to be important controllers of weather patterns locally and regionally. Additionally, second growth forests and agro-forests supply timber, medicines and other products that contribute close to $583 billion dollars every year to the global economy.   Further, as countries around the globe work on combating climate change, REDD+ payments or payments to develo...

Communicating As A Data Scientist

Image
  Wow, this has been a crazy week here in the San Francisco Bay Area! If a pandemic wasn’t enough, we now have over 300 fires burning in the area as a result of an unusual summer thunderstorm accompanied by lightning strikes.     It’s one of the aspects of climate change - that weather becomes more extreme. So, the western US and Australia as well as other areas see less precipitation, or precipitation that is unusual in amounts and timing, warmer temperatures. Thus, drier, warmer conditions that are ideal for these kind of extreme events become more prevalent - and hence, more disasters.     As professionals working in clean technology, we often get tasked with building the models for these systems, understanding what’s happening on the ground and developing new technologies to help solve these problems.     The one thing that many of us don’t really explore is the whole aspect of communicating the science and what the data are telling us.   This...

When AI and Machine Learning come to the forests

  A big thank you to everyone who joined us last weekend for a lively and interesting discussion on data engineering and how to build prototypes that access satellite imagery using Google Earth Engine and Python.   It’s always fun to talk about satellites, imagery and how to get things to work in many different clean technology sectors - agriculture, water, energy, climate and disaster management among them.     Today, let’s talk about one sector that doesn’t get as much attention - forestry.   If you heard the the words forests and satellite imagery in one sentence, what comes to your mind? Deforestation? Reforestation? Wildfires? All three?   Managing our forests sustainably is key to protecting the environment in so many different ways - forests have a huge impact on climate, on ecosystem services and on the livelihoods of communities that rely on them. However, the challenge is that most forests are hard to access and data is often difficult to verify o...

When Satellite Data Improves - What Happens in Clean Technology?

  In June this year,   we had a lively discussion and online workshop on remote sensing data   and how monitoring processes occurring on the Earth was why the Landsat satellite program was launched in the 1970s - a program that’s still running today.     But here’s an interesting question that came up in our conversation - since water, agriculture, energy and other clean tech sectors have been using remote sensing data for such a long time - what is so different now?     To answer that question, let’s first talk about how satellite data is used in clean technology. The sectors where satellite data, and data science in general, are widely used both commercially and in research and development are agriculture, energy, water, climate and disaster management.   So, what are the different uses of satellite data in each of these sectors?   Let’s take agriculture first.   Researchers and scientists have been using satellite data since the 1970s...

A Trillion Dollar Market - But Where Are the People?

  Last week we talked about how the market in clean technology and data science   is already in the multi-billion dollar range and is headed to the multi-trillion dollar space in the next decade or so. However, one of the challenges that analysts highlighted was the lack of professionals who have sufficient expertise in both clean technology and data science. So today, let’s take a look at what’s happening in educating professionals in this exciting, new field as well as the kind of skills that are needed.   Most of the traditional college and university programs haven’t yet caught up with the demand for professionals at this intersection of specialities - although they are getting there! While many universities and colleges have created data science degrees - these usually focus on the problems that are faced by the high-tech and internet sectors. The graduates from these programs usually have a pretty solid understanding of coding, algorithms including machine learning,...

Data Science and Clean Technology: Updated Market Analysis

Image
  If you were to ask people why they’re interested in applying data science in clean technology - the chances are that you’ll come across three answers. 1) They want to make a difference to the planet and help people 2) They think it’s cool technology and want to be at the forefront of innovation and 3) They’ve heard it’s a hot and upcoming field with lots of jobs and opportunities and want to get in at the ground level.   Now, one and two are both pretty obvious - but what about the third reason? Is the intersection of clean technology and data science really such a growing field?     To answer that question, let’s take a look at some numbers. Now, about five years ago, when the field was in its infancy, there was a lot of speculation about the field being anywhere from a   multi-billion dollar market to a multi-trillion dollar market . How did those estimates hold up, now as we look at what the next five years may bring?   As it turns out, the estimates h...

Not just another machine learning algorithm - how solving clean technology applications forces adaptations in basic machine learning techniques.

  One of the first questions I get about workshops on this topic is - why do we need to talk about machine learning again? There are tons of online courses available already, lots of free material on the web and libraries in Python that are easy enough to get started with. So, why look at this stuff once more? Why not just point us to the best existing resources and let us get on with it?   And the answer is - yes, there are lots of excellent resources (free and paid) on machine learning and yes, we’ll have a list of those resources available for additional reference. But, and this is a big but - many of these resources are targeted to problems faced in the high-tech sector, where the data and types of problems are very different. When we’re solving problems in clean technology, the kind of data we have and the questions we’re faced with are often quite different. That means that machine learning algorithms have to be adapted to work in our sector - and the way they get adapte...

Go wide or go deep? Data and models in clean technology

  When faced with a question about agriculture, water, energy, air or another clean technology system, how do you decide to model it? Do you dive down deep into the subject matter and try and figure out what would work? Or do you go wide, look at the interactions between different systems and see how different types of data and models can be combined? Or do you do a mix of both?     Like many other challenges in the data science and clean technology fields, it really depends on the question you’re trying to answer - and the data that are available. If the question you’re trying to answer has a relatively well defined process and sufficient data - then it makes sense to start by diving deep into the subject and looking at different processes and interactions within a relatively narrow field. For example, let’s say that we’re trying to understand the chemical interactions in a water treatment plant process and if there’s a problem with how effectively the treatment process ...