{"id":139450,"date":"2025-05-27T00:21:45","date_gmt":"2025-05-27T00:21:45","guid":{"rendered":"https:\/\/teknomers.com\/en\/bagel-the-creator-of-tiktok-unveils-a-promising-new-multimodal-ai\/"},"modified":"2025-05-27T00:21:46","modified_gmt":"2025-05-27T00:21:46","slug":"bagel-the-creator-of-tiktok-unveils-a-promising-new-multimodal-ai","status":"publish","type":"post","link":"https:\/\/teknomers.com\/en\/bagel-the-creator-of-tiktok-unveils-a-promising-new-multimodal-ai\/","title":{"rendered":"BAGEL: The creator of TikTok unveils a promising new multimodal AI."},"content":{"rendered":"\n<h1>The Rise of ByteDance&#8217;s BAGEL: A Game Changer in AI<\/h1>\n<p>In recent developments, <strong>ByteDance<\/strong>, the parent company of the widely popular social media platform <strong>TikTok<\/strong>, has entered the race for artificial intelligence (AI) innovation with the unveiling of its new model, <strong>BAGEL<\/strong>. This advancement is set to bring significant changes to how we interact with digital content, as it aims to combine multiple modalities\u2014text, images, and videos\u2014into a single, versatile AI platform.<\/p>\n<h2>\n<h2>Understanding BAGEL\u2019s Architecture<\/h2>\n<\/h2>\n<p>At the core of BAGEL lies a groundbreaking architecture known as <strong>Mixture-of-Transformer-Experts (MoT)<\/strong>. This innovative framework is designed to handle and process different types of data concurrently. BAGEL employs <strong>two distinct encoders<\/strong>: one focuses on pixel-level details, while the other captures the <strong>semantic dimensions<\/strong> of visuals. <\/p>\n<p>The model has been trained on billions of <strong>multimodal tokens<\/strong> interspersed with <strong>next group of token prediction<\/strong> paradigms. This allows BAGEL to generate or complete text, images, and video sequences without needing to switch architectures, thus showcasing its <strong>adaptability<\/strong> and <strong>efficiency<\/strong>.<\/p>\n<h2>\n<h2>Proven Results: Early Achievements<\/h2>\n<\/h2>\n<p>According to the initial reports from ByteDance, BAGEL has exhibited impressive performance metrics. For instance, on the <strong>GAIA benchmark<\/strong>, BAGEL received an outstanding score of <strong>82.42<\/strong>, surpassing other advanced models like <strong>Qwen2.5-VL<\/strong> and <strong>InternVL-2.5<\/strong>. In other assessments, such as <strong>MME (2388)<\/strong>, <strong>MMBench (85.0)<\/strong>, and <strong>MM-Vet (67.2)<\/strong>, BAGEL has outperformed leading open-source models of comparable size.<\/p>\n<p>In addition, when tested on text-to-image generation using <strong>GenEval<\/strong>, BAGEL scored <strong>0.88<\/strong>, placing it in close proximity to the industry-standard <strong>Stable Diffusion 3<\/strong>. Furthermore, its <strong>image editing capabilities<\/strong> have shown promise, with indicators like <strong>GEdit-Bench-EN<\/strong> scoring <strong>7.36<\/strong>, confirming its potential for fine visual manipulation right from its initial public release.<\/p>\n<h2>\n<h2>Operational Functions of BAGEL<\/h2>\n<\/h2>\n<p>BAGEL&#8217;s capabilities extend beyond merely transcribing images. It can generate <strong>4K visuals<\/strong> from descriptive text, predict future frames in videos, and even transform the style of photographs. Its creators emphasize the model&#8217;s <strong>integrated reasoning chain<\/strong>, enabling it to articulate its logic across multiple dialog turns. This feature proves particularly useful in <strong>3D navigation<\/strong> and in analyzing complex documents, making BAGEL versatile across various applications.<\/p>\n<h2>\n<h2>Efficiency and Cost-Effectiveness<\/h2>\n<\/h2>\n<p>One of the standout features of BAGEL is its efficiency. By utilizing only <strong>7 billion active parameters<\/strong>, the model significantly reduces inference costs\u2014reportedly by about <strong>40%<\/strong> when contrasted with similarly sized dense models. Internal tests have indicated that BAGEL can generate a <strong>\u201ccyberpunk\u201d<\/strong> image in just <strong>three seconds<\/strong>, achieving a <strong>15% gain in fidelity<\/strong> measured by <strong>SSIM<\/strong>, an industry-standard metric for assessing the similarity between two digital images.<\/p>\n<p>Moreover, it&#8217;s noteworthy that BAGEL can operate on a single <strong>Nvidia A100 GPU<\/strong>, lowering barriers for local exploitation by independent laboratories and studios. While it remains to be seen if these promises will hold in practical applications, the implications for creativity and academia could be profound, if BAGEL proves to be as efficient as projected. Within just twenty-four hours of its launch, the model garnered <strong>50,000 visits<\/strong> on <strong>Hugging Face<\/strong> and already accumulated <strong>3,000 stars<\/strong> on <strong>GitHub<\/strong>.<\/p>\n<h2>\n<h2>ByteDance&#8217;s Strategic Vision<\/h2>\n<\/h2>\n<p>The excitement around BAGEL is compounded by its origin within ByteDance, a company known for its innovative approaches in the social media landscape. This connection gives BAGEL a unique status, as it\u2019s plausible that ByteDance might integrate this AI model into its existing platforms, including TikTok. This could lead to transformative experiences for users, making content creation even more seamless and intuitive.<\/p>\n<h2>\n<h2>The Future of AI with BAGEL<\/h2>\n<\/h2>\n<p>As technological advances continue to propel the capabilities of AI, models like BAGEL signify a leap towards comprehensive multimodal integration. The versatility that this model offers could reshape various sectors, including entertainment, education, and professional services. Understanding how to utilize such tools effectively will be essential in maximizing their potential benefits, paving the way for innovative applications that could redefine our interaction with content across multiple formats.<\/p>\n<p>In conclusion, the entrance of ByteDance into the AI domain with models like BAGEL opens up myriad possibilities for creative and practical applications, potentially setting new standards in the industry. As we look forward to increasingly sophisticated AI solutions, the early evidence of BAGEL\u2019s capabilities suggests that we are on the cusp of a significant transformation in how content is generated and experienced.<\/p>\n<div>\n<p class=\"ed__a-p ed__bdy__l\">Apr\u00e8s <a href=\"https:\/\/www.lesnumeriques.com\/intelligence-artificielle\/baidu-le-google-chinois-court-apres-deepseek-et-chatgpt-avec-deux-nouvelles-ia-agressives-n234266.html\" rel=\"nofollow noopener\" target=\"_blank\">Baidu<\/a>, <a href=\"https:\/\/www.lesnumeriques.com\/intelligence-artificielle\/qu-est-ce-que-deepseek-la-reponse-chinoise-a-chatgpt-a231842.html\" rel=\"nofollow noopener\" target=\"_blank\">DeepSeek<\/a> et <a href=\"https:\/\/www.lesnumeriques.com\/intelligence-artificielle\/alibaba-wan-2-1-la-nouvelle-ia-star-pour-generer-des-photos-et-videos-n233600.html\" rel=\"nofollow noopener\" target=\"_blank\">Alibaba<\/a>, c&#8217;est au tour d&#8217;un autre g\u00e9ant chinois, ByteDance, de se m\u00ealer \u00e0 la course \u00e0 l&#8217;IA. Si le nom de cette entreprise ne vous dit rien, sa cr\u00e9ation majeure ne vous ne sera sans doute pas \u00e9trang\u00e8re, puisqu&#8217;il s&#8217;agit de TikTok. Il y a quelques jours, ByteDance a d\u00e9voil\u00e9 BAGEL, un mod\u00e8le pr\u00e9sent\u00e9 comme g\u00e9n\u00e9raliste, avec 7 milliards de param\u00e8tres actifs \u2014 14 milliards au total. Il est capables d\u2019ing\u00e9rer texte, images ou vid\u00e9os, puis de r\u00e9pondre dans l\u2019un ou l\u2019autre format sans avoir \u00e0 changer d\u2019architecture. Dans la foul\u00e9e, ByteDance a plac\u00e9 le code, les poids ainsi que la documentation sous licence Apache 2.0, confirmant une strat\u00e9gie clairement tourn\u00e9e vers l\u2019ouverture. Vous pouvez d&#8217;ailleurs tester l&#8217;ensemble via <a href=\"https:\/\/demo.bagel-ai.org\/\" target=\"_blank\" rel=\"nofollow noopener\">cette interface de d\u00e9mo<\/a>.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/teknomers.com\/category\/general\/\" rel=\"dofollow\">General News &#8211; 2<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Rise of ByteDance&#8217;s BAGEL: A Game Changer in AI In recent developments, ByteDance, the parent company of the widely popular social media platform TikTok, has entered the race for artificial intelligence (AI) innovation with the unveiling of its new model, BAGEL. This advancement is set to bring significant changes to how we interact with [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":139451,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[36399],"tags":[],"class_list":["post-139450","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/posts\/139450","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/comments?post=139450"}],"version-history":[{"count":0,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/posts\/139450\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/media\/139451"}],"wp:attachment":[{"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/media?parent=139450"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/categories?post=139450"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/tags?post=139450"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}