The Canopy Data Import Tool allows you to import and manipulate text data files in an easy, reproducible way. It is built on top of Pandas, providing an exploratory, graphical interface to data manipulation while still using a familiar Python representation of your data. When you are happy with how your data is being imported, you can take control of the underlying DataFrame from the IPython console in the Canopy Editor. You can also export your command history as a Python script so that you can perform those same manipulations again.
The Data Import Tool is available to Enterprise-level Canopy subscribers. It is no longer available to individual subscribers.
To get started, and for a walkthrough of the provided examples, we encourage you to browse the Data Import Tool online documentation.
FAQ:
-
The Data Import Tool can read most common structured text files (csv, tsv, etc ...), however certain file formats are not accepted (
.py
.xml
.pdf
.json
.h5
.gif
.jpg
.png)
-
To parse data files with fixed column widths, set the DIT to both Fixed Width format and whitespace column separator.
- To use the Data Import Tool with Python 3.5 or 3.6 (in Canopy 2.1.3+), please install the latest version of the catalyst and pyqt packages in the Canopy Package Manager. Note that pyqt is licensed GPL; therefore to install it, the "enthought/gpl" repository must be enabled in Package Manager Settings.
For a list of common issues, please see Known Issues with Data Import Tool.
How do I purchase a subscription so that I can us the Data Import Tool?
I'm sorry, Zachary. All of Canopy's features except the DIT are now available to free users, and we are no longer selling individual Canopy subscriptions. Therefore, the DIT is no longer available to individual subscribers except for existing not-yet-expired subscribers. I have updated this article to reflect that fact. My apologies for the confusion and disappointment.