MAX, or Model Assets Exchange, is an online open-source repository for trainable/deployable AI models.

CODAIT also launched the Data Assets Exchange (DAX).

Where MAX hosts full AI models, DAX contains datasets that can be used to train your own.

Here’s what you need to know about IBM’s new open-source Data Asset Exchange for AI

What does IBM mean when it says the datasets will be carefully curated?

Are they checked for bias or accuracy?

Someone creates a list or a database, and random people from the Internet submit links to data.

Its up to you, the dataset consumer, to figure out whether a given dataset is useful.

Who owns the data?

Did the person who posted it have a right to post it?

Do I have the right to download it?

Can I safely use the data in a business program?

When possible, we reach out to the original creator of the data.

We collect detailed metadata about where the data comes from.

We familiarize ourselves with the research papers behind the datasets.

We even look at the actual data items themselves to check for potential legal and data quality issues.

Every dataset goes through IBMs own internal legal review process.

Only then does a dataset go live on the site.

And we dont stop with just posting this vetted data.

You should start seeing the results of these efforts soon.

And were writing ready-made training scripts for training deep learning models on the data.

40% off TNW Conference!

For every dataset currently on the site, there are roughly three more currently in plan.

For the near term, we are continuing our focus on IBM Research data.

Some datasets are currently waiting for peer-reviewed articles to be published before we can post them.

The current offerings on DAX are pretty eclectic thedoublependulum videos datasetin particular stands out.

What do you see developers using that for?

It should be out soon.

You could also use the video as a sanity check for deep pose estimation algorithms.

Will developers be able to upload datasets to DAX?

We certainly plan to add that capability in the future.

The key challenge there is to maintain the current level of curation and to make the entire process open.

Our current focus is on enabling consumption by developers worldwide.

Having this collection of vetted datasets opens up some exciting possibilities for other related parts of developer.ibm.com.

For more information on IBMs DAX, read the companys blog posthereand check out the datasetshere.

you’ve got the option to view the models available on MAXhere.

Also tagged with