Community docs

Finding and Creating Resources

data.world is designed to easily share resources with your students. In many classroom situations, educators will either want to find data or upload their own. We'll walk through both scenarios shortly, but first, it's helpful to understand that data.world houses two primary types of resources: datasets and projects.

  1. Datasets are collections of data files, documentation, scripts, metadata, and any other supporting resources to help other people understand the data. Datasets can include one file or several related files. While our platform can host a wide variety of file types, spreadsheet files (e.g., .csv, .tab, .xlsx) are likely to be the most useful for you.

  2. Projects are spaces to help you collect multiple datasets, query data, create data visualizations, collaborate on analyses, and share findings.

In a classroom setting, it may be most useful to think of projects as a place to provide details of an assignment. Datasets would then be connected to individual projects as you need them. Refer to our datasets vs. projects documentation for more information on this topic.

Finding data

The data.world platform is host to hundreds of thousands of open datasets. The way to access them is to type a search term into the "Search data.world" bar at the top of your page.

Note

If you have any resources inside of your organization, typing a term in the search bar will display your classroom's resources by default. To search the entire data.world open platform, select "Include all community results" under the "Results" header.

community-results.png

Once you've made your search, you can explore the results or utilize some of our advanced search features by clicking "Advanced" to help you narrow them down. For educators, you may want to try selecting dataset from beneath Resource Type, limiting your search to only include dataset resources. You can then filter further by selecting specific Owners or Tags that match what you're looking for.

Evaluating data you found

Now that you've found data on the data.world platform, you'll want to make sure that the data you've found meets your needs. Here are a few things that you may want to check:

  1. Recency: when was the data uploaded? When was it last updated?

  2. Documentation: does the dataset have a full description? Is the Data Dictionary completed?

  3. License: does this dataset license allow you to use the data for your purposes?

  4. File size: if your students will not be working with the data on the data.world platform, you'll want to check the file size before asking them to download since some programs struggle to open large files. If you're interested in file size limits on data.world, refer to our file size documentation.

While these are not meant to solve every use case you may have when evaluating a dataset you found for use in your classroom, hopefully this has provided you a good place to start.dataset license

Saving a dataset

If you find a dataset that you like, you can bookmark it to save it for later. Bookmarks are attached to your individual profile, not your classroom organization, so your students won't be able to see the things you bookmark.

If you find a dataset that you want to incorporate into a project or assignment, you can link that dataset directly to a project. We'll get into more detail on how to do that in the section on Creating an Assignment.

You also may find following particular organizations helpful. That way, you'll be notified whenever that organization adds new data or updates any of their existing datasets. Here are a few examples of some organizations you may be interested in following:

Data Journalism Organizations

Collections of Data Curated by the data.world Team

To follow an organization, you can visit their profile then click the "Follow" button. When organizations you are following make any updates, you'll receive an email notification about those updates.

Uploading Data

If you have found open data elsewhere that meets your needs but isn't on the data.world platform, you can upload it to the platform. You can either upload the data resource directly to your personal account or to your classroom. The process is the same.

  1. Either way, you'll want to click on the +New button located on the header bar right beside your profile picture.

  2. Select "Create new dataset"

  3. Give your dataset a name. This should be "human readable", meaning that it doesn't have to match the file name or omit spaces. Name your dataset something descriptive so that other users know what it's about, particularly if you plan to share this openly on the platform. Keep in mind that datasets can be connected to several projects, so instead of naming a dataset "Assignment 1", it's better practice to name your dataset something based on the contents of the data.

  4. Decide whether you want your classroom organization to be the "owner" of this dataset, or your personal account. If you plan to use the dataset with multiple classes, it may be better to make your personal account the "owner."

  5. Decide who to share the resource with:

    1. If you don't want to share this resource with anyone, select "No One"

    2. If you want to only share this resource with the students in your data.world classroom, make sure the classroom is the owner of the resource and select "All of _____" (where the ____ is the name of your classroom)

    3. If you want to share your dataset with the larger data.world community, click "Make public to data.world community"

      new-dataset.png
  6. Then click "Create dataset". Refer to our Creating Datasets documentation for more information.

  7. Next, add a brief description of this dataset. What data is included? What is the goal? Why would someone want to explore your dataset? This description and the title of your dataset will be visible when people on the open platform find your dataset in their searches.

  8. Then, upload your data by clicking the "Add data" button. You will be given several ways to add a data resource, including uploading from your computer, syncing from a URL, or integrating with other tools like Google Drive. When you're done, click "Continue".

  9. Now, add your documentation. Use the "Summary" area to provide any additional information or materials about where the data came from, how it was collected, any caveats involved etc. The dataset Summary area can be edited using Markdown syntax or our Simple Text Editor. Refer to our Document your data documentation for more information on data dictionaries, tagging, setting license types and more.

  10. For each file that you've uploaded to a single dataset, you can select "Add description" to add a brief description to each file. You can also add labels to indicate whether the data you've uploaded is "raw" or "clean".

    dataset-description.png
  11. If your file is tabular (e.g., a spreadsheet), you can provide information about how to interpret each column of data in the data dictionary. You can access the data dictionary in the "Add a description" menu, under the "Column details" tab.

    data-dictionary.png
  12. If your dataset is open to the public, you'll want to make sure that you add both Tags and a License. You can do that by clicking "Edit" next to "About this dataset". Tags make your dataset easier to find when users are searching for it, and licenses let other users know what they are allowed to do with your dataset. Refer to the "Setting a license type" and "Tagging" sections of our Document your data documentation for more information.

    edit-dataset.png
Creating an Assignment

On the data.world platform, projects can be used as spaces to share or collaborate on a particular assignment. For more details on the differences between projects and datasets, see the documentation on resources.

To create a new assignment within your classroom:

  1. Click on the +New button located on the header bar right beside your profile picture.

  2. Select "Create a new Project"

  3. Give your project a name. This should be "human readable", meaning that it doesn't have to match the URL or omit spaces. Name your project something descriptive so that other users know what it's about, particularly if you plan to share this openly on the platform.

  4. Set the Owner of the project to your classroom organization.

  5. Decide who you would like to share this assignment with:

    1. If you don't want to share this resource with anyone, select "No One"

    2. If you only want to share this resource with the students in your data.world classroom, make sure the classroom is the owner of the resource and select "All of ____" (where the ___ is the name of your classroom)

    3. If you want to share your project with the larger data.world community, click "Make public to data.world community"

      new-project.png
  6. Click "Create Project"

  7. Add a brief description of your project. What is the overall goal? Maybe add the due date here for easy reference.

  8. Next, you can connect data to this project. You can also connect data later, if you don't know which resources you want to connect yet. There are a few ways you can do this:

    1. Connect to a dataset that's already on data.world. Click "add data" and then select "Link a data.world dataset". Find the dataset you want to link either by searching, looking at your resources, or looking through your personal bookmarks. Once you find the dataset you want to add, click "Connect".

    2. Upload your own dataset. See the Uploading Data documentation for more information.

  9. Tip

    When you connect to an existing dataset, rather than upload files directly into a project, the dataset can be reused for other projects. If the dataset and associated projects are public, you can also find queries that have been run and other projects that have used a dataset on that dataset's profile page.

    connected-projects.png
  10. When you're finished, click "Done"

  11. Click "Project summary" to add additional information about this project. For assignments, include any information your students would find helpful or relevant to completing the assignment.

If you have finished creating your assignment and want to connect more datasets, you can:

  • Click on "Home" in the project directory and then click "Upload files or connect to data source". Refer to Steps 8-10 in the above section to continue.

If you've found a dataset and want to add it to a project:

  1. In the top right of the dataset overview page, you can click the arrow next to "Explore this dataset".

  2. If the project for your assignment has already been created, click "Connect to existing projects" and then search for a project that you have access to. Select the assignment you are looking for and click "Save".

  3. If the project for your assignment hasn't been created yet, select "Create a new project". This will walk you through the process of creating a new project with the dataset you've selected already added. See the Creating an Assignment section for more information.

    explore-dataset.png