hackquest logo

ClearSky

ClearSky makes the world’s environmental data trustworthy, tradable, and ready for AI.

ビデオ

説明

ClearSky - Turning Environmental Telemetry Into Verifiable, Monetizable Story-IP


Visit Us

Environmental data is everywhere, in cities, on rooftops, inside vehicles, outside schools, on farmland, inside factories.
But even though air-quality readings and pollution telemetry influence public policyscientific researchhealth advisories, and AI climate models, the underlying datasets have a hidden problem that most people never see:

They are fundamentally untrustworthy.

Today’s environmental datasets have no verifiable source of origin, no cryptographic integrity, no transparent licensing, and no fair economic structure for the people or devices producing the data. They are treated as public utilities but operate like unregulated content streams scraped, aggregated, repackaged, and redistributed without proof, ownership, or accountability.

ClearSky was built to fix that.

ClearSky transforms real-world environmental telemetry into verifiable, licensable, and composable IP assets on Story Protocol. It creates a regenerative data economy where sensor operators, dataset buyers, and AI model builders participate in a transparent value chain, each earning royalties through Story’s programmable IP graph.

It is not just a dataset marketplace.
It is not just a DePIN simulation.
It is the first end-to-end pipeline where data → dataset → model → derivative forms a continuous, auditable chain of IP.


The Problem: The Hidden Fragility of Environmental Data

AI cannot trust its training data

Environmental AI models forecasting pollution, planning transportation, predicting asthma spikes depend on clean, high-quality datasets. But current sources provide no cryptographic trail. AI researchers struggle with:

  • unverifiable raw data

  • no information about device authenticity

  • unclear licensing or commercial usage rights

  • inconsistent formats and no lineage across datasets

If the data feeding the model is unverified, the model is untrustworthy by design.

Enterprises and climate agencies need evidence-grade telemetry

City governments, environmental boards, climate researchers, and mitigation planners base their decisions on data that appears authoritative but offers no proof that:

  • the readings come from real devices

  • the timestamps have not been manipulated

  • no intermediary tampered with the payloads

  • geographical origin is accurate

  • the dataset is complete

Scientific-grade environmental data must be legally defensible and cryptographically verifiable, not just plausible.

Sensor operators receive no ownership or reward

People or communities who deploy sensors today whether hobbyists, NGOs, universities, or local groups   generate real economic value. Yet they receive:

  • no recognition

  • no licensing control

  • no royalties if someone buys the dataset

  • no incentive to maintain or expand sensor coverage

In today’s system, the data creators earn nothing, even though they are the foundation of the entire stack.

Environmental telemetry needs provenance.
It needs licensing.
It needs fair economics.
It needs composability.

ClearSky delivers all four.


A New Data Economy Built on Story Protocol

ClearSky turns raw environmental readings like PM2.5, CO₂, temperature, humidity into programmable intellectual property.
Here’s the core idea:

  1. Sensors sign their telemetry, proving data origin.

  2. ClearSky organizes this into Merkle-backed, verifiable window sets.

  3. When a buyer requests a dataset, ClearSky assembles it, pins it to IPFS, and mints it as a Story IP asset.

  4. Story Protocol attaches licensing and royalty rules to that dataset.

  5. Buyers can build AI models or analytics on top of the dataset and mint those models as derivative IP.

  6. Royalties automatically flow backward   from models → datasets → sensors   creating a regenerative IP graph.

The result:
Every actor in the chain from sensor operator to AI scientist becomes an IP creator with enforceable rights.

ClearSky showcases Story Protocol at full conceptual potential:

Programmable IP Assets for datasets

When a buyer selects a dataset (e.g., “Delhi, 4–6 PM”), ClearSky builds a Merkle-verified slice and mints it as a new Story IP asset. This provides:

  • immutable provenance

  • transparent licensing

  • programmable royalties

  • long-term IP rights for the buyer

This is Story Protocol’s vision applied to real-world data.

Derivative IP for AI models

A dataset is not the end   it is the beginning.
ClearSky enables builders to:

  • train AI models

  • generate insight reports

  • build risk predictors

  • create pollutant classifiers

and mint these creations as child IP assets linked to the parent dataset.

This brings Story’s “derivative creativity graph” into the real world.

Royalty routing across the IP graph

Whenever a dataset is purchased or a model derived from it is sold:

  • the dataset creator (buyer) earns

  • the sensor operators whose data contributed earn

  • further derivatives earn from their descendants

This demonstrates Story Protocol’s programmable royalties operating across a multi-level pipeline.

AI-ready datasets + verifiable origin

The platform produces datasets that can be openly used for:

  • machine learning

  • climate risk modelling

  • urban planning

  • academic research

with the full lineage preserved.

DePIN-aligned incentive model

While ClearSky uses simulated devices for the hackathon, the architecture mirrors real DePIN networks:

  • human-linked device registration

  • signed payloads

  • incentive distribution

  • composable data layers

It is a realistic blueprint of a decentralized environmental data network.


How ClearSky Works

ClearSky is designed to feel intuitive from the outside, even though the internal economics and cryptography are sophisticated.

Below is the end-to-end product flow:


1. A sensor operator joins and registers devices

Operators authenticate and link up to three sensors to their identity.
Each sensor now functions like a cryptographic data source with:

  • a unique keypair

  • metadata (location/type)

  • attribution rights

This establishes the foundation for provenance.


2. Sensors emit signed telemetry

Each reading is:

  • canonicalized

  • hashed

  • signed with the device key

  • bundled into internal time windows

This gives every datapoint a proof of origin, which survives all downstream transformation.


3. ClearSky organizes data into verifiable windows

Instead of making every window an IP asset, ClearSky uses them as building blocks.

Think of them as “digital ore”   raw materials awaiting refinement.
Each window has:

  • compressed data

  • Merkle root

  • proofs for each reading

  • IPFS pin

  • reference metadata

These windows allow fast assembly of future custom datasets.


4. A buyer purchases a dataset

A buyer enters the marketplace and selects:

  • time range

  • geography

  • pollutant types

  • dataset resolution

ClearSky fetches the relevant data windows, reconstructs the dataset slice, and prepares it for minting.


5. ClearSky mints the dataset as a Story IP asset

For every purchased dataset:

  1. The assembled dataset is pinned to IPFS.

  2. A manifest describing the dataset, Merkle root, and metadata is created.

  3. The dataset is minted as a Story IP asset.

  4. Appropriate license terms (commercial, private, etc.) are attached.

  5. Royalty parameters are set.

The buyer now owns a licensed, on-chain dataset IP asset.

This dataset is legally and cryptographically theirs to use.


6. Sensor operators earn royalties

ClearSky computes contribution weights (e.g., number of readings in the dataset).
When the dataset is purchased:

  • Story Protocol routes royalties to relevant operators

  • A public on-chain record logs these allocations

  • Operators accumulate earnings as creators of the dataset’s underlying telemetry

This is the first time environmental data producers are economically recognized.


7. Buyers become builders derivative AI & analytics

The buyer can now:

  • train a machine learning model

  • generate analytical insights

  • produce risk indices

  • build pollution classifiers

  • create visualizations

And mint their creations as derivative IP assets linked to the original dataset.

Because the dataset is IP, the model inherits and extends that IP lineage.

This creates a chain of value:

sensors → dataset → model → derivative → downstream products


8. A marketplace of datasets and models emerges

ClearSky becomes a two-sided ecosystem:

For data buyers

High-integrity, evidence-grade datasets with licenses and provenance.

For model builders

A composable, monetizable catalog of datasets to build on.

For researchers and enterprises

Transparent IP lineage showing where models came from.

For sensor operators

An income stream tied to real economic usage of their data.


Impact: Why ClearSky Matters

ClearSky turns environmental telemetry once treated as passive, disposable data into active intellectual propertythat powers a new class of AI and scientific innovation.

1. For Climate Science

Data used for research and policy becomes provable, tamper-resistant, and royalty-aligned.

2. For Governments & Enterprises

Evidence-backed datasets support regulatory calculation, air-quality planning, and compliance audits.

3. For AI Teams

Training data comes with cryptographic provenance and enforceable licensing.

4. For Data Creators

Communities, NGOs, hobbyists, and device operators finally get recognition and monetization.

5. For Story Protocol

ClearSky is a flagship use case demonstrating how real-world signals can be transformed into a living IP economy where creativity compounds across time.


In One Sentence

ClearSky converts environmental signals into verifiable, licensable Story-IP datasets and enables a regenerative economy where sensor operators, data buyers, and AI model builders all earn together across an evolving IP graph.

ハッカソンの進行状況

During the Surreal Build Hackathon, our team focused on proving the complete lifecycle of ClearSky’s data-to-IP pipeline on Story Protocol. We implemented the essential backbone required to turn environmental telemetry into verifiable dataset IP, attach licensing, distribute royalties, and enable derivative AI models. First, we built a device sensor simulator, each device signing its telemetry using ECDSA. A backend ingestion service validates signatures, canonicalizes payloads, and stores readings. We then implemented a batching pipeline that recomputes hashes, constructs Merkle proofs, and pins each window’s manifest and data shard to IPFS establishing verifiable provenance for every reading. On top of these windows, we created an on-demand dataset builder that assembles any buyer-selected time range, computes a unified Merkle root, and pins the resulting dataset and manifest to IPFS. We integrated Story Protocol to mint these buyer-requested datasets as IP assets, attach commercial license terms, and define programmable royalty parameters. A contribution-based weighting system calculates royalties for sensor operators proportional to their data included in purchased datasets. We also implemented the ability for dataset buyers to mint derivative IP, enabling them to upload models or analytic artifacts, pin them to IPFS, and register them as child IP assets linked to the dataset. Finally, we built a license-gated download system using signed wallet messages and on-chain license validation, ensuring only authorized buyers can access dataset files. In summary, during the hackathon we delivered a functioning pipeline: verifiable telemetry → dataset assembly → IP minting → licensing → royalty allocation → derivative IP → secure access. This validates ClearSky as a practical, composable data IP ecosystem powered by Story Protocol.

テックスタック

Web3
DePIN
Solidity
MERN

資金調達の状況

No funds raised yet.

チームリーダー
AAbhinav Pangaria
オープンソース
業界
InfraSocialFiAI