![IBM Watson Projects](https://wfqqreader-1252317822.image.myqcloud.com/cover/334/36699334/b_36699334.jpg)
上QQ阅读APP看书,第一时间看更新
Refine
One of the more important IBM Watson tasks is Refine (which we mentioned earlier in this chapter). Here will walk through the basics, using some Watson sample data:
- From the Quick-start bar, click on Refine. From the Refine data set dialog, you then can scroll down and select Sample data; at this point you will see the Sample data dialog, which is displayed as follows:
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/7895c4e8-2689-4500-9024-0ad45d7a6cba.png?sign=1739303804-4BDKKZZXvBDnjIUZqUUb6dCuszW3ctaL-0-cf1bc186ded0a0d0f273735511d71902)
- IBM Watson provides a nice list of sample data, each worth spending a bit of your time exploring and experimenting with. For now, let's pick Bike Sharing data set and then click on Upload:
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/fd55e780-04aa-498b-bd90-1f88f15c70ec.png?sign=1739303804-6cNtWXTxj1uxNdHSzfenM4Kty4x1Cohl-0-529d8fb96a6280592bcbdcf631884f63)
To access the sample data it needs to be uploaded. Once uploaded it will appear as an informational tile (shown as follows) that can be selected and used:
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/4088c37d-e3b9-4408-8695-4db61158c546.png?sign=1739303804-2mwbxeUOIFvzefnESKiohmB2cLHiOiJk-0-ac7b8b267c382363642bf9a3062f17d1)
- Now that we have our data loaded and available, you can select it from the Refine data set list, which automatically loads it into the Refine page (which looks a lot like an Excel worksheet):
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/96bccdd2-acf4-41bb-bccf-244165702c5a.png?sign=1739303804-yOJ48UAzPGyyDz61wXN5cfjYfMzXTPaI-0-c072076d2065f1d0473c939e67b7a42e)
There are many tasks you can perform using Refine, such as:
- General housekeeping: Such as renaming columns, changing data types, or creating a subset of the data by filtering out irrelevant records
- Summarization: By altering the default aggregations
- Enrichment: By adding calculated fields, hierarchies and groups
- Review the metrics of the data, such as a quality score by data field or column
For now, let's assume we've made some of the previously-mentioned refinements to our data and want to save it as a new file. To do that, you simply click on the SAVE icon (looks such as a tiny diskette in the upper left of the page), enter an appropriate name for the new file, and click Save (on the Save as popup shown as follows):
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/4bca9538-35e3-410a-aa32-ef69105a92a6.png?sign=1739303804-eFKjFSrK0LLOk7PMvC473F1lwkTulj9I-0-126b86fe268acebc469eb9bdf333b0c5)
If you are working in a multi-user environment, your new (refined) dataset is saved by default in your personal folder. To share your refined dataset with others, move it to a shared folder.