The dataset viewer is not available for this split.
Cannot load the dataset split (in streaming mode) to extract the first rows.
Error code:   StreamingRowsError
Exception:    FileNotFoundError
Message:      [Errno 2] No such file or directory: 'zip://0.png::https://Model Database.co/datasets/TrainingDataPro/generated-usa-passeports-dataset/resolve/main/data/original.zip'
Traceback:    Traceback (most recent call last):
                File "/src/services/worker/src/worker/utils.py", line 264, in get_rows_or_raise
                  return get_rows(
                File "/src/services/worker/src/worker/utils.py", line 205, in decorator
                  return func(*args, **kwargs)
                File "/src/services/worker/src/worker/utils.py", line 242, in get_rows
                  rows_plus_one = list(itertools.islice(ds, rows_max_number + 1))
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/iterable_dataset.py", line 1379, in __iter__
                  for key, example in ex_iterable:
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/iterable_dataset.py", line 233, in __iter__
                  yield from self.generate_examples_fn(**self.kwargs)
                File "/tmp/modules-cache/datasets_modules/datasets/TrainingDataPro--generated-usa-passeports-dataset/58d0275d20f5cbcc02c318a3de4543e1c828b9ad7b030e04f5893a6c88e9d448/generated-usa-passeports-dataset.py", line 143, in _generate_examples
                  'original': load_image_file(org),
                File "/tmp/modules-cache/datasets_modules/datasets/TrainingDataPro--generated-usa-passeports-dataset/58d0275d20f5cbcc02c318a3de4543e1c828b9ad7b030e04f5893a6c88e9d448/generated-usa-passeports-dataset.py", line 88, in load_image_file
                  img = PIL.Image.open(file)
                File "/src/services/worker/.venv/lib/python3.9/site-packages/PIL/Image.py", line 3236, in open
                  fp = builtins.open(filename, "rb")
              FileNotFoundError: [Errno 2] No such file or directory: 'zip://0.png::https://Model Database.co/datasets/TrainingDataPro/generated-usa-passeports-dataset/resolve/main/data/original.zip'

Need help to make the dataset viewer work? Open a discussion for direct support.

GENERATED USA Passports Dataset

Data generation in machine learning involves creating or manipulating data to train and evaluate machine learning models. The purpose of data generation is to provide diverse and representative examples that cover a wide range of scenarios, ensuring the model's robustness and generalization.

Data augmentation techniques involve applying various transformations to existing data samples to create new ones. These transformations include: random rotations, translations, scaling, flips, and more. Augmentation helps in increasing the dataset size, introducing natural variations, and improving model performance by making it more invariant to specific transformations.

The dataset contains GENERATED USA passports, which are replicas of official passports but with randomly generated details, such as name, date of birth etc. The primary intention of generating these fake passports is to demonstrate the structure and content of a typical passport document and to train the neural network to identify this type of document.

Generated passports can assist in conducting research without accessing or compromising real user data that is often sensitive and subject to privacy regulations. Synthetic data generation allows researchers to develop and refine models using simulated passport data without risking privacy leaks.

The dataset is solely for informational or educational purposes and should not be used for any fraudulent or deceptive activities.

Get the dataset

This is just an example of the data

Leave a request on https://trainingdata.pro/data-market to discuss your requirements, learn about the price and buy the dataset.

Content

Folders

  • original: includes original generated images of USA passports
  • augmentation: contains subfolders, corresponding to the original photos and including 3 black and white generated passport scans with different photo editing.

The augmentated photos are presented with random rotations, noise and brightness. Augmentation varies depending on the amount of noise and blur in the passport images, from slight (us_pass_augmentated_1) to significant (us_pass_augmentated_3).

File with the extension .csv

includes the following information for each media file:

  • original: link to access the image of the generated passport,
  • us_pass_augmentated_1: link to the first augmentated image,
  • us_pass_augmentated_2: link to the second augmentated image,
  • us_pass_augmentated_3: link to the third augmentated image

USA Passeport Photos might be generated in accordance with your requirements.

**TrainingData**

More datasets in TrainingData's Kaggle account: https://www.kaggle.com/trainingdatapro/datasets

TrainingData's GitHub: https://github.com/Trainingdata-datamarket/TrainingData_All_datasets

Downloads last month
2
Edit dataset card
Evaluate models HF Leaderboard