• Home
  • News
  • Coins2Day 500
  • Tech
  • Finance
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
CommentaryInternet

The A.I. revolution could be under threat as Big Tech attempts to take web data out of the public domain

By
Or Lenchner
Or Lenchner
Down Arrow Button Icon
By
Or Lenchner
Or Lenchner
Down Arrow Button Icon
May 17, 2023, 10:36 AM ET
Facebook parent Meta has released open A.I. models.
Facebook parent Meta has released open A.I. models.Josh Edelson—AFP/Getty Images

Technologies that fundamentally change society only come around once every decade or so. The Internet was one. Artificial Intelligence (A.I.) Is the next. A.I. Has the potential to improve lives and reshape industries from healthcare to finance–but A.I. Can only be as good as the quality of data it’s trained on.

The extensive growth of text, images, videos, and audio available on the public web has fueled the rise of A.I. Models by providing a constantly expanding source of information. This is why researchers predict that AI, already a $137 billion industry, will grow more than 37% each year this decade.

For instance, Meta recently released LLaMA, “a collection of foundation language models” that aim at democratizing access to A.I. Research. “We train our models on trillions of tokens, and show that it is possible to train state-of-the-art models using publicly available datasets exclusively,” the Facebook parent said.

However, even as it touts the importance of publicly available data to A.I., Meta is simultaneously pursuing litigation to close access to public web data that it acknowledges it does not own.

If Big Tech is allowed to build a walled garden around data that’s present in the public domain (meaning data that isn’t behind a login), it will prevent A.I. From reaching its full potential.

Looking ahead, the volume of data and information created, captured, copied, and consumed worldwide is expected to reach 120 zettabytes this year–nearly triple what it was in just 2019.

If publicly available web data is stripped from the public and held onto only by the most powerful companies, the ability for A.I. To advance in a way that benefits society would be severely limited. If only a few companies were developing cutting-edge A.I., its development will not be aligned with humanity’s best interests.

Publicly available data is not only the lifeblood of emerging artificial intelligence tools, but it’s also essential for current business operations. Companies and nonprofits alike rely on publicly available web data to efficiently and effectively carry out their missions, with 94% using it on a daily basis, according to a survey of 150 IT, technology, and data analytics experts from U.S. Retail, technology, and nonprofit organizations. In this survey, nearly four out of five respondents stated they would be unable to operate effectively without access to public web data.

The potential for A.I. To be used for social good is equally exciting. For example, through our pro bono program, The Bright Initiative, we assist nonprofit, academic and charitable organizations, helping them tackle serious social problems such as antisemitism, hate speech, and human trafficking.

More broadly, developers must have access to the datasets they need to ethically train A.I. By providing a vast amount of diverse and up-to-date information, public web data can be used to train machine learning models, improve accuracy, and ensure A.I. Is aligned with humanity’s goals.

Or Lenchner is the CEO of Bright Data, a web data platform dedicated to maintaining transparent access to public web data for all.

The opinions expressed in Coins2Day.com commentary pieces are solely the views of their authors and do not necessarily reflect the opinions and beliefs of  Coins2Day .

More must-read commentary published by Coins2Day:

  • Stanford researchers scoured every reputable study for the link between video games and gun violence that politicians point to. Here’s what the review found
  • Is it smart to be a ‘stupid genius’ like Elon Musk?
  • Why there will be no winners in the never-ending war between Disney and DeSantis
  • America’s ‘disease burden’ is getting heavier by the day–and it’s unevenly distributed across states
Coins2Day Brainstorm AI returns to San Francisco Dec. 8–9 to convene the smartest people we know—technologists, entrepreneurs, Coins2Day Global 500 executives, investors, policymakers, and the brilliant minds in between—to explore and interrogate the most pressing questions about AI at another pivotal moment. Register here.
About the Author
By Or Lenchner
See full bioRight Arrow Button Icon
Rankings
  • 100 Best Companies
  • Coins2Day 500
  • Global 500
  • Coins2Day 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Leadership
  • Success
  • Tech
  • Asia
  • Europe
  • Environment
  • Coins2Day Crypto
  • Health
  • Retail
  • Lifestyle
  • Politics
  • Newsletters
  • Magazine
  • Features
  • Commentary
  • Mpw
  • CEO Initiative
  • Conferences
  • Personal Finance
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Coins2Day Brand Studio
  • Coins2Day Analytics
  • Coins2Day Conferences
  • Business Development
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Coins2Day
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map

© 2025 Coins2Day Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Coins2Day Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.