Researchers at LAION took a chunk of the Common Crawl data and pulled out every image with an " alt" tag, a line or so of text meant to be used to describe images on web pages.
LAION 的研究人员从 Common Crawl 数据中提取了一大块,并提取出带有 “alt” 标签的每张图像,这是一行左右的文本,用于描述网页上的图像。