Joho the Blog » What data would ChatGPT add to train a weather model
Everyday Chaos
Too Big to Know
Too Big to Know
Cluetrain 10th Anniversary edition
Cluetrain 10th Anniversary
Everything Is Miscellaneous
Everything Is Miscellaneous
Small Pieces cover
Small Pieces Loosely Joined
Cluetrain cover
Cluetrain Manifesto
My face
Speaker info
Who am I? (Blog Disclosure Form) Copy this link as RSS address Atom Feed

What data would ChatGPT add to train a weather model

My prompt

If you were going to train a model to predict rain showers, what data might you use that is currently not being widely used?

ChaptGPT 4o

Many advanced weather models incorporate a variety of data sources, but some of the more innovative and less traditional data types mentioned are not commonly used in current weather prediction models. Here are some of the data types from the previous list that are typically underutilized or not used at all:

Environmental and Geographical Data

  1. Soil Moisture Levels:
    • While some models use soil moisture data, it is not as widely incorporated or detailed as it could be.
  2. Vegetation and Land Cover Data:
    • Detailed vegetation and land cover data are often not integrated into weather prediction models at the level of granularity that might be beneficial.
  3. Surface Roughness:
    • Data on surface roughness is generally underutilized in most weather prediction models.

Atmospheric Data

  1. Aerosol Concentrations:
    • Detailed aerosol concentration data, particularly on a fine scale, is not commonly used in many weather prediction models.
  2. Atmospheric Composition:
    • Comprehensive data on various gases beyond the primary greenhouse gases (like ozone and carbon dioxide) are generally not integrated into weather prediction models.
  3. Cloud Microphysics:
    • Detailed cloud microphysics data, such as cloud droplet size distributions and ice crystal concentrations, are typically underutilized.

Human and Technological Data

  1. Pollution Levels:
    • Pollution data, including detailed air quality indices, are not commonly used in weather prediction models.
  2. Traffic and Transportation Data:
    • Vehicle emissions and heat data from traffic and transportation are rarely used in current models.

Temporal and Historical Data

  1. Real-time Social Media and Crowd-sourced Data:
    • While there are some experimental uses, real-time social media and crowd-sourced weather observations are not widely used in standard weather models.

Biological and Ecological Data

  1. Plant Phenology:
    • Data on plant life cycles and their impact on local humidity and temperature are generally not used in weather prediction models.
  2. Animal Movement Patterns:
    • Animal behavior in response to atmospheric conditions is not utilized in current weather models.

Advanced Technological Data

  1. LiDAR Data:
    • While LiDAR is used in some atmospheric research, it is not commonly integrated into standard weather prediction models.

Integration of Multidisciplinary Data

  1. Health and Epidemiological Data:
    • Data on health conditions correlated with atmospheric changes are generally not used in weather prediction models.
  2. Economic Activity Data:
    • Information on industrial activity and energy consumption is typically not integrated into weather prediction models.


To improve weather prediction, especially for phenomena like rain showers, integrating these underutilized data sources could provide a more comprehensive and accurate understanding of atmospheric processes. Leveraging innovative data sources such as aerosol concentrations, detailed cloud microphysics, and real-time social media observations could enhance the predictive power of weather models.

Previous: « || Next: »

Leave a Reply

Comments (RSS).  RSS icon