{"id":19824,"date":"2023-06-21T06:51:00","date_gmt":"2023-06-21T06:51:00","guid":{"rendered":"https:\/\/web3unplugged.io\/blog\/?p=19824"},"modified":"2023-06-22T06:55:28","modified_gmt":"2023-06-22T06:55:28","slug":"cloudinary-adds-generative-ai-to-its-programmable-media-image-and-video-apis","status":"publish","type":"post","link":"https:\/\/web3unplugged.io\/blog\/cloudinary-adds-generative-ai-to-its-programmable-media-image-and-video-apis\/","title":{"rendered":"Cloudinary adds Generative AI to its Programmable Media Image and Video APIs"},"content":{"rendered":"\n<p>Cloudinary, the image and video platform that powers many of the world\u2019s top brands, today announced the availability of several new generative AI, large language models (LLM) and GPT-based features within its&nbsp;Programmable Media image and video APIs&nbsp;including Generative Fill, Generative Remove, Generative Replace, AI-powered Image Captioning, and a ChatGPT-backed natural language interface. The new Cloudinary features, available now, allow users tocreate customized, personalized assets in seconds and help technical teams scale quickly by intelligently automating workflows and eliminating repetitive and time-consuming image manipulation tasks. Learn more about these new features in today\u2019s blog posts&nbsp;here&nbsp;and&nbsp;here.<\/p>\n\n\n\n<p>Advanced creative work is time-consuming, expensive and, for some brands, simply out of reach. More than 10,000 customers and 1.5 million users have long benefited from the power of AI via Cloudinary\u2019s award-winning image and video APIs. Today\u2019s new generative AI capabilities extend these benefits even further by making what was once impossible, possible and more accessible for users to create, edit and deliver dynamic visual experiences at unprecedented scale. For example, instead of re-shooting an entire campaign, developers and digital marketers can remove unwanted objects and create beautiful images at scale through Cloudinary\u2019s APIs. Likewise, AI-powered image captioning produces intelligent captions for images instantly to improve accessibility, asset searchability and SEO while boosting productivity and reducing production time.<\/p>\n\n\n\n<p>\u201cGenerative AI has completely transformed the way we work, and the most recent advancements are just scratching the surface of what\u2019s possible,\u201d said Nadav Soferman, co-founder and chief product officer, Cloudinary. \u201cOur founding mission was to revolutionize the way in which brands manage and deliver images and video at scale, and building solutions that harness the most advanced technologies has been central to delivering on that promise.\u201d<\/p>\n\n\n\n<p>Soferman continued, \u201cSince launching our flagship image management product, which utilized AI for face-detection-based cropping, we\u2019ve led advancements across media management, leveraging the power of AI, machine and deep learning, and raising the bar for what\u2019s possible in media creation and delivery. It\u2019s always been about letting advanced technology streamline or eliminate tedious tasks so brands can focus their limited resources on creating high impact, highly visual sites, apps and campaigns that connect, engage, inspire and convert. We are very excited to make these powerful new generative AI capabilities a reality for the technical and non-technical teams committed to bringing their best visual stories to life \u2013 and we\u2019re just getting started.\u201d<\/p>\n\n\n\n<p><strong>New features bring ease and automation to visual media workflows<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Generative Fill:<\/strong>&nbsp;Enables users to enhance and expand an image with ease. For example, users can intelligently expand and extend an original image, especially useful when needing to transform an image from vertical to horizontal. With generative fill, the new AI-generated background will blend seamlessly with the original image.<\/li><li><strong>Generative Remove:<\/strong>&nbsp;Via natural language prompts, users can remove unwanted elements from images and automatically add a matching background. The new feature uses state-of-the-art technology such as open-set object detection models and powerful AI capabilities through Stable Diffusion.<\/li><li><strong>Generative Replace:<\/strong>&nbsp;Allows users to easily detect, change and replace unwanted elements and colors all via natural-language prompts. This capability is especially useful for users looking to more easily create color-based variations of products or to improve web accessibility for those with color blindness.<\/li><li><strong>AI-powered Image Captioning:&nbsp;<\/strong>Intelligently creates image captions for galleries, user-generated content and product descriptions at scale. Image-to-text features strengthen image SEO, improve accessibility, and automate image classification for better findability. It also helps e-commerce users save time by automatically creating smart product descriptions.<\/li><li><strong>Conversational Transformations Builder:<\/strong>&nbsp;This intuitive feature provides a natural language interface through ChatGPT, allowing users to effortlessly communicate desired image transformations and optimizations. For example, a simple command such as \u201cplease blur this image and crop to a 1:1 aspect ratio\u201d would deliver a complete and correct transformation.<\/li><\/ul>\n\n\n\n<p><strong>AI at its core from the start<\/strong><\/p>\n\n\n\n<p>Cloudinary has a long history of delivering powerful AI capabilities to its customers via trusted industry-leading AI technologies such as OpenAI, Google Vision, and Amazon Rekognition, as well as its own domain expertise and content-aware machine learning models including those for background removal, smart image tagging, video cropping and domain-specific models for industries such as fashion and furniture. For more than a decade, Cloudinary\u2019s image and video solutions have leveraged AI and ML, offering the most advanced image and video transformations including face detection, contextual cropping and auto-tagging, and now generative AI capabilities and ChatGPT integrations for intelligent image captioning. With&nbsp;Cloudinary Assets, digital asset management is effortless through UI-based auto-tagging, AI-powered visual search, and content moderation, enabling seamless workflows for user-generated content. What\u2019s more, the Cloudinary Assets Studio feature harnesses generative AI power to make editing bulk assets simple and powerful.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Cloudinary, the image and video platform that powers many of the world\u2019s top brands, today announced the availability of several new generative AI, large language models (LLM) and GPT-based features within its&nbsp;Programmable Media image and video APIs&nbsp;including Generative Fill, Generative Remove, Generative Replace, AI-powered Image Captioning, and a ChatGPT-backed natural language interface. The new Cloudinary [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":19826,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_seopress_robots_primary_cat":"none","_seopress_titles_title":"","_seopress_titles_desc":"","_seopress_robots_index":"","footnotes":""},"categories":[2],"tags":[],"class_list":["post-19824","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news"],"rttpg_featured_image_url":{"full":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",150,150,false],"landscape":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",150,150,false],"portraits":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",150,150,false],"thumbnail":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png-150x150.jpg",150,150,true],"medium":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",150,150,false],"large":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",150,150,false],"1536x1536":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",150,150,false],"2048x2048":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",150,150,false],"post-thumbnail":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",150,150,false],"graptor-sq-xs":["https:\/\/web3unplugged.io\/blog\/wp-content\/uploads\/2023\/06\/cloudinary_cloud_glyph_blue_png.jpg",100,100,false]},"rttpg_author":{"display_name":"Admin CG","author_link":"https:\/\/web3unplugged.io\/blog\/author\/admin-cg\/"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/web3unplugged.io\/blog\/category\/news\/\" rel=\"category tag\">news<\/a>","rttpg_excerpt":"Cloudinary, the image and video platform that powers many of the world\u2019s top brands, today announced the availability of several new generative AI, large language models (LLM) and GPT-based features within its&nbsp;Programmable Media image and video APIs&nbsp;including Generative Fill, Generative Remove, Generative Replace, AI-powered Image Captioning, and a ChatGPT-backed natural language interface. The new Cloudinary&hellip;","_links":{"self":[{"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/posts\/19824","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/comments?post=19824"}],"version-history":[{"count":1,"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/posts\/19824\/revisions"}],"predecessor-version":[{"id":19827,"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/posts\/19824\/revisions\/19827"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/media\/19826"}],"wp:attachment":[{"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/media?parent=19824"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/categories?post=19824"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/web3unplugged.io\/blog\/wp-json\/wp\/v2\/tags?post=19824"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}