crj1998
  • Joined on Nov 24, 2022

Datasets

voice
speech synthesis speech processing 5

voice

Updated 1 month ago

Benchmark
image description generation computer vision and natural language processing 14

Diffusion benchmark

Updated 1 month ago

lora
text annotation computer vision 5

1

Updated 6 months ago

LayoutGenerate
image description generation computer vision and natural language processing 8

Layout Generate

Updated 8 months ago

Flickr-8k
image search computer vision and natural language processing 12

A new benchmark collection for sentence-based image description and search, consisting of 8,000 images that are each paired with five different captions which provide clear descriptions of the salient entities and events. … The images were chosen from six different Flickr groups, and tend not to contain any well-known people or locations, but were manually selected to depict a variety of scenes and situations

Updated 9 months ago

pokemon-blip-captions
image description generation computer vision and natural language processing 2

pokemon-blip-captions

Updated 11 months ago

CelebAMask-HQ
face recognition computer vision 2

CelebAMask-HQ is a large-scale face image dataset that has 30,000 high-resolution face images selected from the CelebA dataset by following CelebA-HQ. Each image has segmentation mask of facial attributes corresponding to CelebA. The masks of CelebAMask-HQ were manually-annotated with the size of 512 x 512 and 19 classes including all facial components and accessories such as skin, nose, eyes, eyebrows, ears, mouth, lip, hair, hat, eyeglass, earring, necklace, neck, and cloth.

Updated 1 year ago