Background images for training #4

twehrbein · 2023-08-17T08:54:52Z

Hi! Thanks for releasing the code!
I'm trying to reproduce the training and thus need to gather all the training and validation backgrounds. Following your description, I used the lsun repo to download and extract the backgrounds. However, now I'm struggling with 1) selecting the "correct" background images and 2) converting them to the right format and to the right location.
The provided script data/copy_lsun_images_to_train_files_dir.py doesn't work for me, since I guess the directory structure isn't correct after extracting the images. E.g. "bedroom_train_lmdb" extracts images to e.g. ./f/8/8/1/2/2/*.webp which isn't compatible with your script. Your script also only looks for .jpg files. Furthermore, I don't know how to select the mentioned 397582 training backgrounds, since e.g. "bedroom_train_lmdb" alone has over 3mio images. Would be grateful for any help!

The text was updated successfully, but these errors were encountered:

akashsengupta1997 · 2023-08-22T19:02:17Z

Hey!

That's odd, IIRC the script used to work with the dataset as extracted. I will take a look this weekend and get back to you.

twehrbein · 2023-09-11T09:02:17Z

Hey, any update?

Fly-Pluche · 2023-10-23T00:52:27Z

Hello, may I ask if there is any progress？

noahcao · 2024-04-10T22:41:16Z

Hey is there any update?

One way may work, you should change the function for exporting images as [see issue]

def export_images(db_path, out_dir, flat=True, limit=-1):
    print('Exporting', db_path, 'to', out_dir)
    env = lmdb.open(db_path, map_size=1099511627776,
                    max_readers=100, readonly=True)
    count = 0
    with env.begin(write=False) as txn:
        cursor = txn.cursor()
        for key, val in cursor:
            if not flat:
                image_out_dir = join(out_dir, '/'.join(key[:6].decode()))
            else:
                image_out_dir = out_dir
            if not exists(image_out_dir):
                os.makedirs(image_out_dir)
            print('Current key:', key)
            image_out_path = join(image_out_dir, key.decode() + '.jpg')
            img = cv2.imdecode(
                numpy.fromstring(val, dtype=numpy.uint8), 1)
            cv2.imwrite(image_out_path, img)
            count += 1
            if count == limit:
                break
            if count % 1000 == 0:
                print('Finished', count, 'images')

then, you should extract the images with a --flat flag:

python3 data.py export *_val_lmdb --out_dir val
python3 data.py export *_train_lmdb --out_dir train

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Background images for training #4

Background images for training #4

twehrbein commented Aug 17, 2023

akashsengupta1997 commented Aug 22, 2023

Uh oh!

twehrbein commented Sep 11, 2023

Uh oh!

Fly-Pluche commented Oct 23, 2023

Uh oh!

noahcao commented Apr 10, 2024 •

edited

Loading

Uh oh!

Background images for training #4

Background images for training #4

Comments

twehrbein commented Aug 17, 2023

akashsengupta1997 commented Aug 22, 2023

Uh oh!

twehrbein commented Sep 11, 2023

Uh oh!

Fly-Pluche commented Oct 23, 2023

Uh oh!

noahcao commented Apr 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

noahcao commented Apr 10, 2024 •

edited

Loading