Easy Data Processor: Merge, Clean, and Transform Your Data

This powerful and optimized actor is designed for efficient merging, cleaning, and transformation of large datasets.

Why Use This Actor

Speed: Experience blazing-fast data processing with parallelized workloads—up to 20x faster than standard methods.
Efficiency: Simultaneously read from multiple datasets, making it ideal for merging data after scraping.
Reliability: Actor migration proof with persisted steps, ensuring no repeated work or duplicated data.
Memory Management: 'Dedup as loading' mode allows for efficient memory usage, even with huge datasets (10M+ items).
Flexibility: Remove duplicates using multiple fields and nested objects/arrays with deep equality checks.
Storage Options: Store results in key-value store records.
Fast Blank Runs: Quickly identify duplicates without processing data.

Merging

Combine items from multiple datasets into a single dataset or key-value store output. In 'Dedup after load' mode, the order of items retains the order of the provided datasets.

Cleaning (Deduplication)

Specify fields for deduplication to remove duplicate items based on field values. Combine multiple fields for more precise deduplication. Deep comparison is used for objects and arrays.

Transformation

Perform custom data transformations before and after deduplication with preDedupTransformFunction and postDedupTransformFunction. These functions take an array of items and return a modified array.

Access helper variables and the Apify SDK reference within transformation functions. Customize transformations to suit your needs—whether filtering, adding, or modifying items.

Start optimizing your data processing workflow today with the Easy Data Processor and handle large datasets with ease!

Connect With Us

Blog: Read our latest articles
YouTube: Visit our channel
Instagram: Follow us on Instagram
AI Newsletter: Subscribe to our newsletter
Free Consultation: Book a free consultation call
More Tools: Explore our Apify actors

Support

Discord: Raise a support ticket here
Email: Contact us

Start enhancing your data processing today with QuickLifeSolutions' Easy Data Processor!

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.actor		.actor
src		src
.DS_Store		.DS_Store
.editorconfig		.editorconfig
.eslintrc		.eslintrc
.gitignore		.gitignore
.npmignore		.npmignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
INPUT_SCHEMA.json		INPUT_SCHEMA.json
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Easy Data Processor: Merge, Clean, and Transform Your Data

Why Use This Actor

Merging

Cleaning (Deduplication)

Transformation

Connect With Us

Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Easy Data Processor: Merge, Clean, and Transform Your Data

Why Use This Actor

Merging

Cleaning (Deduplication)

Transformation

Connect With Us

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages