bitcoin
bitcoin

$106449.443371 USD

-0.66%

ethereum
ethereum

$3929.929826 USD

-4.01%

xrp
xrp

$2.607985 USD

5.66%

tether
tether

$0.999449 USD

-0.08%

solana
solana

$227.642593 USD

2.88%

bnb
bnb

$722.591686 USD

-0.87%

dogecoin
dogecoin

$0.398431 USD

-3.15%

usd-coin
usd-coin

$0.999907 USD

-0.01%

cardano
cardano

$1.078773 USD

-2.90%

tron
tron

$0.285221 USD

-3.86%

avalanche
avalanche

$48.915771 USD

-3.47%

chainlink
chainlink

$27.702254 USD

-7.83%

shiba-inu
shiba-inu

$0.000027 USD

-3.07%

toncoin
toncoin

$5.905827 USD

-4.18%

sui
sui

$4.634218 USD

-2.82%

加密货币新闻

Sora Turbo 和 Gemini 2.0 标志着创意产业和人工智能代理时代的关键时刻

2024/12/17 23:00

上周,OpenAI 推出了文本到视频生成人工智能 (AI) 工具“Sora”。该公司于 2024 年 2 月首次预览了该工具,但表示 Sora 的功能自那时以来已经不断发展,因此他们将最新版本称为“Sora Turbo”。

Sora Turbo 和 Gemini 2.0 标志着创意产业和人工智能代理时代的关键时刻

AI, Artificial Intelligence, ChatGPT, generative AI, Gemini 2.0, Google, Grok, OpenAI, Sora, text-to-video, X.

AI、人工智能、ChatGPT、生成式 AI、Gemini 2.0、Google、Grok、OpenAI、Sora、文本到视频、X.

Hot Topics in AI: OpenAI’s Sora text-to-video generative AI tool now available to some users

人工智能热门话题:OpenAI 的 Sora 文本到视频生成人工智能工具现已向部分用户开放

OpenAI recently launched its text-to-video generative artificial intelligence (AI) tool, codenamed “Sora.” While the company first previewed this tool in February 2024, they claim that Sora’s capabilities have evolved significantly since then, prompting them to dub the latest version “Sora Turbo.” Notably, OpenAI is currently making this tool available to ChatGPT Plus and Pro users.

OpenAI 最近推出了其文本到视频生成人工智能 (AI) 工具,代号为“Sora”。虽然该公司于 2024 年 2 月首次预览了该工具,但他们声称自那时以来,Sora 的功能已经发生了显着发展,促使他们将最新版本称为“Sora Turbo”。值得注意的是,OpenAI 目前正在向 ChatGPT Plus 和 Pro 用户提供此工具。

However, the company is heavily restricting access, leaving many people—including me—unable to try the tool immediately after its launch.

然而,该公司严格限制访问,导致包括我在内的许多人无法在该工具推出后立即尝试该工具。

For those with access, Sora Turbo allows users to create up to 50 videos per month at 480p resolution for GPT Plus subscribers, while GPT Pro subscribers can create up to 10x more videos at higher resolutions, including 1080p. Currently, videos max out at 20 seconds, and users can blend their assets with AI-generated elements or create entirely new content from text prompts.

对于有访问权限的用户,Sora Turbo 允许用户每月为 GPT Plus 订阅者创建最多 50 个 480p 分辨率的视频,而 GPT Pro 订阅者可以以更高分辨率(包括 1080p)创建最多 10 倍的视频。目前,视频时长最长为 20 秒,用户可以将其资源与 AI 生成的元素混合,或根据文本提示创建全新的内容。

The release of text-to-video tools like Sora marks a pivotal moment for the creative industry. On the one hand, businesses will likely leverage these tools to produce high-quality marketing materials quickly and at a fraction of the cost of traditional production methods. Similarly, independent creators can craft videos for personal projects or even indie films that rival professional studio output. These tools have the potential to level the field in video production and allow anyone with a creative vision to bring their ideas to life.

Sora 等文本转视频工具的发布标志着创意产业的关键时刻。一方面,企业可能会利用这些工具快速生产高质量的营销材料,而成本只是传统生产方法的一小部分。同样,独立创作者可以为个人项目制作视频,甚至可以与专业工作室的作品相媲美的独立电影。这些工具有潜力提升视频制作领域的水平,让任何有创意的人都能将他们的想法变成现实。

However, text-to-video AI also introduces new risks, including potential misuse in creating deepfakes. As these tools become more accessible, bad actors and trolls will try to exploit this technology to deceive others. We’ve already seen similar issues arise with text-to-image and text-to-audio AI, and text-to-video is poised to become the next attack vector.

然而,文本到视频的人工智能也带来了新的风险,包括在创建深度伪造品时可能被滥用。随着这些工具变得越来越容易使用,不良行为者和巨魔将试图利用这项技术来欺骗他人。我们已经看到文本转图像和文本转音频人工智能出现了类似的问题,而文本转视频有望成为下一个攻击媒介。

Google launches Gemini 2.0 and enters the era of AI Agents

谷歌推出Gemini 2.0,进入AI Agent时代

Last week, Google (NASDAQ: GOOGL) announced its latest AI innovation: Gemini 2.0, which the company describes as its most advanced multimodal model yet. In their official announcement, they even go as far as to say that Gemini 2.0 will usher in a new era of “agentic” AI, enabling the creation of autonomous agents designed to simplify everyday tasks.

上周,谷歌(纳斯达克股票代码:GOOGL)宣布了其最新的人工智能创新:Gemini 2.0,该公司将其描述为迄今为止最先进的多模式模型。在官方声明中,他们甚至表示 Gemini 2.0 将迎来“代理”人工智能的新时代,从而能够创建旨在简化日常任务的自主代理。

One of the first applications of Google’s AI agent powered by Gemini is Project Mariner, a Google Chrome extension currently in beta testing. Mariner acts as an AI-powered virtual assistant, capable of autonomously executing tasks like adding items to shopping carts, gathering information from multiple websites, and advising users on optimal strategies in games. However, to ensure safety and responsible use, Google says that Mariner will require a human somewhere in the operating loop at the moment, requiring user confirmation before the AI agent takes final action on things like making purchases.

由 Gemini 提供支持的 Google AI 代理的首批应用程序之一是 Project Mariner,这是一款目前正在进行 Beta 测试的 Google Chrome 扩展。 Mariner 充当人工智能驱动的虚拟助手,能够自主执行任务,例如将商品添加到购物车、从多个网站收集信息以及为用户提供游戏中的最佳策略建议。然而,为了确保安全和负责任的使用,谷歌表示,Mariner 目前需要有人参与操作循环,在人工智能代理对购买等事情采取最终行动之前需要用户确认。

Google’s announcement signals that AI agents are becoming a significant focus for the industry. Unlike chatbots, which serve as enhanced search engines, AI agents introduce an entirely new use case. These tools can perform complex tasks autonomously, but their adoption may require re-education for users unfamiliar with this type of technology. While chatbots like ChatGPT have become second nature for many and were rather intuitive for most, AI agents are something entirely different. There is no direct digital substitute for their functionality, and that gap may make adoption slower than expected.

谷歌的声明表明人工智能代理正在成为该行业的一个重要焦点。与充当增强型搜索引擎的聊天机器人不同,人工智能代理引入了一个全新的用例。这些工具可以自主执行复杂的任务,但它们的采用可能需要对不熟悉此类技术的用户进行重新教育。虽然像 ChatGPT 这样的聊天机器人已经成为许多人的第二天性,并且对大多数人来说相当直观,但人工智能代理却完全不同。它们的功能没有直接的数字替代品,这种差距可能会使采用速度慢于预期。

I see accessibility and ease of use as one of the biggest challenges for the average AI user. I feel that AI agent workflows will be so unfamiliar to this group that they will need some sort of training or education before they can dive into these systems.

我认为可访问性和易用性是普通人工智能用户面临的最大挑战之一。我觉得人工智能代理工作流程对于这个群体来说非常陌生,他们需要接受某种培训或教育才能深入了解这些系统。

X makes Grok free and adds Tweet analysis features

X 使 Grok 免费并添加了推文分析功能

Meanwhile, X (formerly Twitter) expanded access to its internal AI chatbot, Grok, by making it free for all users. Previously available only to X Premium subscribers, Grok has also received two more notable upgrades, including enhanced text-to-image generation and a new feature for analyzing tweets.

与此同时,X(前身为 Twitter)通过向所有用户免费开放其内部人工智能聊天机器人 Grok 的访问权限。 Grok 以前仅适用于 X Premium 订阅者,现在还获得了两项更显着的升级,包括增强的文本到图像生成和用于分析推文的新功能。

While its image generation capabilities are impressive, its text-to-text outputs fall short of industry standard. That being said, the Grok Analysis tool is the standout feature. It allows users to break down tweets into digestible summaries with context and links to related news or background information.

虽然其图像生成功能令人印象深刻,但其文本到文本的输出未达到行业标准。话虽这么说,Grok 分析工具是最突出的功能。它允许用户将推文分解为易于理解的摘要,其中包含上下文以及相关新闻或背景信息的链接。

While I find this tool useful, I still find it flawed. For instance, when I asked a follow-up question about an analyzed tweet, Grok seemed to “forget” the initial context, leading to fragmented conversations rather than a continuous dialogue from the first message sent.

虽然我发现这个工具很有用,但我仍然发现它有缺陷。例如,当我询问有关分析后的推文的后续问题时,Grok 似乎“忘记”了最初的上下文,导致对话支离破碎,而不是从发送的第一条消息开始进行连续对话。

Despite integrating into X’s ecosystem, Grok still lags behind leading AI chatbots like GPT-4, Claude 3.5, and Google’s Gemini 2.0.

尽管融入了 X 的生态系统,Grok 仍然落后于 GPT-4、Claude 3.5 和 Google Gemini 2.0 等领先的人工智能聊天机器人。

新闻来源:coingeek.com

免责声明:info@kdj.com

The information provided is not trading advice. kdj.com does not assume any responsibility for any investments made based on the information provided in this article. Cryptocurrencies are highly volatile and it is highly recommended that you invest with caution after thorough research!

If you believe that the content used on this website infringes your copyright, please contact us immediately (info@kdj.com) and we will delete it promptly.

2024年12月18日 发表的其他文章