SEATTLE, April 6, 2021 /PRNewswire/ — To tackle the fast enhance within the demand for high-quality, bias-aware AI coaching information, DefinedCrowd introduced at present the growth of its on-line information market, DefinedData, to third-party suppliers to sell or share AI datasets, in addition to a collaboration with NVIDIA to offer dataset samples by means of the NVIDIA NGC catalog. In addition, the platform now offers AI engineers with unprecedented ranges of coaching information transparency, and a variety of subscription options, with particular reductions for academia.
- AI is Top of the Corporate Agenda
The international pandemic has pushed AI to the highest of the company agenda. A study by IDC reveals that, in 2020, the AI market was predicted to be price $300 billion by 2024. As of February 2021, the market is anticipated to interrupt the $500 billion mark in 2024. A McKinsey survey discovered that responses to the disaster sped the adoption of digital applied sciences by a number of years, with 61% of high-performance firms growing their funding in AI. This acceleration in AI improvement has created an enormous enhance within the demand for high-quality datasets.
- Avoiding AI Bias Through Data Transparency
As extra AI techniques are deployed, and at a sooner charge, implications of underlying bias come up. To tackle this concern, OutlinedData’s catalog now provides detailed data on the gender, age, accent, and phonetic distribution of datasets in addition to meta-data on the recordings, and audio samples. Access the up-to-date catalog here.
- Democratizing Data Access through NVIDIA NGC
As a key step in democratizing entry to information, DefinedCrowd will present dataset samples by means of the NVIDIA NGC catalog, a GPU-optimized hub for AI and HPC containers, pre-trained fashions and SDKs that simplifies and accelerates end-to-end workflows. Datasets can be utilized to prepare fashions utilizing libraries throughout the NVIDIA Jarvis utility framework; NVIDIA Transfer Learning Toolkit, which permits builders to construct production-quality fashions sooner with no coding required; in addition to the NVIDIA NeMo platform, a Python toolkit for constructing, coaching, and fine-tuning unmatched GPU-accelerated conversational AI fashions. This collaboration permits researchers and builders to construct high-quality, state-of-the-art conversational AI fashions.
“By working with DefinedCrowd, we’re providing NVIDIA Jarvis and NeMo users with sample datasets to build and accelerate their models, all within the NGC environment,” mentioned Richard Kerris, head of developer relations at NVIDIA.
- Affordable Dataset Subscriptions
DefinedCrowd is introducing OutlinedData subscriptions, offering entry to a continually increasing and refined catalog of high-quality speech and NLP datasets.
“Companies constantly need to engage a long tail of data in order to grow in new sectors, and data scientists need the raw material in order to address these issues as data science becomes more democratic each day,” mentioned Director of Machine Learning at DefinedCrowd, Dr. Christopher Shulby. “This offering will allow data scientists to keep their models relevant in a continually evolving world.”
To study extra about OutlinedData’s subscriptions, follow this link. Academia can have entry to particular pricing choices.
DefinedCrowd is encouraging third events to listing and promote their datasets on OutlinedData, in an effort to tackle the growing demand for AI coaching information. To guarantee world-class high quality, all datasets might be subjected to a vetting course of earlier than being made out there. Express your curiosity in turning into a vendor on OutlinedData here.
“This is an exciting moment. I am proud to see DefinedCrowd becoming the GitHub of AI,” mentioned Founder and CEO, Dr. Daniela Braga. “Transparent, traceable and bias-aware data is crucial to build ethical AI technologies.”
Catarina Peyroteo Salteiro
Director of Global Communication & Brand
SOURCE DefinedCrowd Corp.