In a recent thread on X, Amira Valliani from the Solana Foundation sparked a conversation about why advanced robots from companies like Boston Dynamics and Waymo aren't as ubiquitous as we'd expect. Teaming up with Rishin Sharma, they dove into the hurdles preventing robotics from having its "ChatGPT moment." The crux? Data—or rather, the lack of it for the physical world.
If you've followed AI developments, you know large language models like ChatGPT were trained on massive digital datasets: text, videos, and audio scraped from decades of internet content. But robots need data from the real world—things like sensor readings, environmental interactions, and unpredictable scenarios. As Amira points out, we simply don't have the same volume or quality of physical data yet.
The Data Dilemma in Physical AI
Training AI for the physical realm is tricky. Internal data from robots, like joint positions and forces, combines with external inputs such as camera feeds and audio. But the real world is messy: edge cases like sudden weather changes, wildlife encounters, or urban obstacles aren't well-documented in open datasets. Current open robotics data hovers around 5TB, dwarfed by the 100TB+ used for digital AI.
Big players like Tesla are tackling this by paying workers $48/hour for tasks like folding laundry to generate training data for their Optimus robot. It's effective but centralized and costly. This is where Decentralized Physical Infrastructure Networks (DePIN) come in, offering a blockchain-powered alternative that's distributed, incentivized, and scalable.
How DePIN Mobilizes Global Data Collection
DePIN flips the script by using crypto incentives to crowdsource data from millions worldwide. Instead of relying on a single company's workforce, these networks reward contributors for uploading high-quality, unique data. Validators ensure accuracy, creating a self-sustaining ecosystem.
This approach is already making waves across sectors:
Autonomous Vehicles: Projects like Hivemapper, ROVR, and NATIX are mapping roads globally. NATIX, for instance, partnered with ride-hailing giant Grab to cover 170 million kilometers with 250,000 drivers.
Drones and Precision Sensing: GEODNET and Onocoy focus on aerial data, while Raad Labs handles specialized sensing.
Humanoid Robots and Automation: Emerging players like Bitrobot, PrismaX, and Reborn are building datasets for industrial and humanoid applications. Others, such as Auki Labs and OverTheReality, integrate AR for spatial mapping.
Even game engines are getting involved, with Shaga using simulated environments from games like Grand Theft Auto to train self-driving AI.
The Massive Opportunity Ahead
The potential market for physical AI is enormous. Projections estimate:
- Autonomous driving: $350 billion by 2035
- Drone networks: $83 billion by 2035
- Humanoid robots: $38 billion by 2035, potentially $5 trillion by 2050
- Overall physical AI: $100–$600 billion by 2035
DePIN could democratize this space, moving away from proprietary silos toward open, community-driven models. On Solana, with its high-speed blockchain, these networks can handle the transaction volume needed for incentives and validations efficiently.
Challenges and Realistic Expectations
Of course, it's not all smooth sailing. Market fragmentation means demand is concentrated among a few big buyers like Tesla or DeepMind, with small contract sizes. Simulations might prove cheaper and sufficient for many use cases, leveraging physics engines from gaming. Plus, crowdsourced data needs to balance generality with specificity—generic uploads might not cut it for niche applications.
Privacy concerns, regulatory hurdles, and ensuring data quality are ongoing issues. But as Amira and Rishin note in their full deep dive on Solana's blog, DePIN's cryptoeconomic model could be the key to unlocking robotics' full potential.
This thread highlights how blockchain isn't just about finance or memes—it's enabling real-world AI innovations. For blockchain practitioners, keeping an eye on DePIN projects could reveal the next wave of opportunities in meme tokens and beyond, especially those tied to AI and robotics themes. Stay tuned as this space evolves!