Data import is an essential step in data cleaning and processing. On the Supametas.AI platform, whether it is text, images, audio, or video, any raw data that needs cleaning and processing can be imported into a dataset for management. This article will explain in detail how to import metadata and briefly introduce task management and progress monitoring steps.
1. Import Metadata
Once you have created a dataset, you can import various types of raw data into it for further cleaning and processing. The platform not only supports importing data through APIs but also allows direct imports from various file formats and data sources, making data processing more flexible and efficient.
2. Choose Import Method
On the import page, you can select the appropriate import method based on the data source, including but not limited to:
-
Import from API:
- Suitable for users with development capabilities who want to import data via API calls.
- Please refer to the related API documentation for detailed configuration.
-
Import from Webpage:
- Supports web data scraping starting with
https://
, suitable for collecting data directly from the internet.
- Supports web data scraping starting with
-
Import from Local Text:
- Supports file formats such as
.docx
,.pdf
,.txt
,.md
,.json
, making it easy to import various document data.
- Supports file formats such as
-
Import from Audio:
- Supports audio file formats like
.mp3
,.wav
, ideal for processing audio data.
- Supports audio file formats like
-
Import from Image:
- Supports image files in
.png
,.jpg
formats, helping manage image data.
- Supports image files in
-
Import from Video:
- Supports video files like
.mov
,.mp4
,.mpv
, enabling fast integration of video data.
- Supports video files like
Choosing the appropriate import method ensures that data enters the dataset in the correct format, improving the efficiency of subsequent cleaning and processing.
3. Task Filtering and Management
To better manage import tasks, Supametas.AI provides a flexible task filtering feature to help you quickly locate and manage tasks:
-
Task Status Filtering:
- Quickly search for target tasks based on task status (e.g., "Not Started", "Importing", "Completed", "Import Failed").
-
Search Tasks:
- Quickly locate specific tasks by entering the task name, which is helpful when managing a large number of tasks.
This efficient task management mechanism ensures that you stay informed about the status of each task, allowing you to make adjustments during the data import process.
4. Monitor Import Progress
The import page will display the progress of all tasks in real-time, helping you fully control the data import process. The main features include:
-
Start Button (▶️):
- Start an import task that has been configured but not yet started. Once started, the system will automatically fetch and import data according to preset rules.
-
Stop Button (⏸️):
- Interrupt a task that is currently running. This is useful when a configuration error is detected or a task needs to be paused temporarily. Once stopped, the task status will display as "Stopped."
-
Delete Button (❌️):
- Permanently delete a task and its related uploaded data. Please note that the delete operation is irreversible, so ensure the task is no longer needed before using this feature.
With these intuitive operation buttons, you can adjust and control your data import tasks at any time to ensure the entire data processing workflow runs smoothly and efficiently.
Importing metadata is a crucial step in the Supametas.AI cloud service. By flexibly choosing data import methods and utilizing efficient task management and real-time progress monitoring features, you can easily manage the entire data cleaning process.