translation

This is an AI translated post.

Byungchae Ryan Son

The 'Body' in the AI Age: How to View It

Select Language

  • English
  • 汉语
  • Español
  • Bahasa Indonesia
  • Português
  • Русский
  • 日本語
  • 한국어
  • Deutsch
  • Français
  • Italiano
  • Türkçe
  • Tiếng Việt
  • ไทย
  • Polski
  • Nederlands
  • हिन्दी
  • Magyar

Summarized by durumis AI

  • OpenAI has unveiled Sora, a new service that creates AI videos from text input, which can create realistic videos through abstraction from text and connection with parameter space.
  • While Sora still has limitations, OpenAI plans to create more realistic videos through 6 years of AI training, and as seen in The Monster's Origin, AI-generated images and videos will play an important role in social communication.
  • Especially with the phenomenon of the social consumption of decontextualized images, a change is needed from what to see to how to see, and as we discuss the future of technology, questions about the human body will become more complex and important.

Last weekend, Open AI released a new service called Sora. Sora allows you to create AI videos by entering text, and has been receiving rave reviews from reviewers since its release. The demo videos released through a few simple lines of text are as realistic as scenes planned, directed, and filmed in commercials or movies, such as slow-motion footage of a middle-aged man eating a hamburger, a low-angle shot capturing a Japanese street with two people walking, and a low-angle shot capturing a Japanese street with two people walking.


On the same day, Open AI expressed Sora as a world simulator in the technical report it released. Abstraction through text and its connection to the parameter space make this level of realistic abstraction possible, and will have a huge impact on the reasoning capabilities of future GPT models. The significance of this is enormous in terms of the influence of AI on human society.

https://play-tv.kakao.com/embed/player/cliplink/rv4bqfvi1rq49f96sh1gmfm02@my?service=daum_brunch&section=article&showcover=1&showinfo=0&extensions=0&rel=0

Prompt: Step-printing scene of a person running, cinematic film shot in 35mm.

However, Sora still has many limitations. For example, a person takes a bite out of a cookie, but the bite mark disappears afterward, and a person running on a treadmill runs with awkward arm and leg movements facing backwards, not forwards where the control panel is located. There are shortcomings in accurately understanding and showing spatial details within the user-provided prompt, changes occurring over time, and so on. These shortcomings in demonstrating cause and effect, and difficulties in implementing physics in complex scenes, are evidence that AI still does not fully understand the world. However, considering Open AI's 2023 announcement that it plans to train AI for another six years through its contract with Shutter Stock, which has 35 million high-resolution photos and videos, it is not difficult to expect that AI-generated images and videos will become even closer to the level of human-made content in the real world.


In his 2013 book "The Origins of Monsters," British archaeologist David Wengrow argues that from primitive Egypt to Mesopotamian civilization and the Mediterranean Iron Age, when cities were formed, civilizations flourished, and political and commercial networks expanded, there were no regions where visual images of fantastic and complex unreal creatures were not created and spread.

Images of fantastic creatures from the Pazyryk and Tuekta kurgans, South Russia


Graphic depictions of composite monsters, such as the griffin with a lion's body and an eagle's head and wings, and the Minotaur with a human body and a bull's head, were recorded in various murals and documents by people who lived in that era, and were conveyed along expanded trade routes for the sake of justifying the elite's status. The phenomenon of separating limbs or other features of various species and recombining them to create images of beings that correspond to the extended world that cannot be seen can be interpreted as a result of a change in society where the perspective of those who considered themselves "whole" centered on kinship relations expanded, leading to an increase in division of labor with people they had never met, and an awareness of themselves as "parts," which led to fear. In other words, one of the messages he wants to share is the discovery of the relationship between the creation of composite objects and self-awareness confirmed by technological advancements.


We have already seen countless interesting but grotesque images through AI image generation programs such as Midjourney and DALL-E since last year, such as a Roman-era statue of Spider-Man, an astronaut riding a horse on the moon's surface, and a robot painting with three arms, which go beyond the limits of the human body. The release of Sora will accelerate the spread of AI-generated videos even more. What is important is that the depiction and social transmission of monsters, i.e. composite objects, which are repeated along with the records of early human history, are being reproduced in the age of generative AI. This trend can be described as a "vibe" that envelops the world of this era, which is difficult to capture in a simple expression like trend.Although this expression is not new, we need to take this trend more seriously because we seem to be at a remarkable inflection point as we approach the AI era.


Long-form argumentation and explanation are impossible, and the most consumed and addictive YouTube and TikTok short videos prove the social consumption of decontextualized images.Momentary and associative poetic images have become increasingly important, which means that existing perspectives on interpretation related to what is considered information have been overturned.


In short, we need a change from what to see to how to see.


We need to choose a more semantic and analytical approach to trends and phenomena related to the body. It is an era where the philosophy of lingerie brands that confidently express a variety of bodies as they are is recognized instead of the aesthetics of the model legions in Guess and Calvin Klein marketing campaigns, and an image of pink lipstick above lips that does not look like a man or a woman becomes the embodiment of an effective strategy to approach teenagers. When imagining the future of technology and discussing technological advancements, the human body always exists alongside it. The question of how to view the human body will become even more complex and important in the future.


References



Byungchae Ryan Son
Byungchae Ryan Son
찾아가 관찰하고 경청하는 일을 합니다.
Byungchae Ryan Son
The Age of Monsters, Humans Within Through the book 'The Origin of Monsters', we learned that the monster image was used to symbolize power and authority in primitive civilization. The author argues that this phenomenon is due to cognitive, socioeconomic, and institutional factors. How is

May 22, 2024

The World Infused with Generative AI: Discord Theory and Process The advancement of generative AI technology raises new questions about the competitiveness of creators in the fields of art, advertising, and media. This article emphasizes that in a rapidly changing digital environment, creators need to emphasize rich pe

May 9, 2024

The Meaning of Human in 2023, After the Rise of Generative AI 2023 was a year of significant changes driven by AI's dominance across the technology industry, including a shift towards generative AI platforms, increased public familiarity, and the agreement on AI regulatory bills. In 2024, alongside advancements in

May 16, 2024

Image Generation AI: The Revolutionary Technology and Challenges in Reality Image generation AI is a revolutionary technology that creates realistic images from text input alone. It is used in a variety of fields such as art, design, and marketing, but problems such as intellectual property rights, bias, and fake image generation
durumis AI News Japan
durumis AI News Japan
durumis AI News Japan
durumis AI News Japan

May 6, 2024

Mr. Know-All – 2023.7 The first issue of "Mr. Know-All," a monthly AI magazine in July 2023, introduces the latest AI technologies and trends, including Claude 2, Azure OpenAI, LangChain, and LlamaIndex. In particular, it provides a detailed explanation of LlamaIndex, which em
Pilot AISmrteasy
Pilot AISmrteasy
Pilot AISmrteasy
Pilot AISmrteasy

March 21, 2024

My personal perspective on the near future AI will bring about significant changes in the future of developers. While AI will automate simple tasks such as prototyping and front-end development, skilled developers will focus on strategic problem-solving and creative idea generation. On the other h
Mark Ball Blog
Mark Ball Blog
Image about my personal vision of the future
Mark Ball Blog
Mark Ball Blog

March 26, 2024

What is LLM (Large Language Model)? Large Language Models (LLMs) are a core technology in artificial intelligence, learning from vast amounts of text data to acquire language processing abilities similar to humans. They can be used in various fields such as chatbots, translation, and text g
세상 모든 정보
세상 모든 정보
세상 모든 정보
세상 모든 정보

April 1, 2024

"MetaTopia", a Metaverse company with solid references MetaTopia is a company established by KAIST graduates, awarded the Minister of Science and ICT Award, that provides 3D metaverse space design and construction services. It has various references including Seoul Station, Seoul Innovation Park, Newsroom Bro
Nahee Noh
Nahee Noh
Nahee Noh
Nahee Noh

March 20, 2024

The Dawn of the AI Age: Harmonizing Technological Innovation with Ethics Google and OpenAI have showcased the groundbreaking advancements in AI technology by unveiling their new AI models 'Gemini' and 'GPT-4'. Gemini can process information from various modalities, summarize information, and provide answers to queries. It is s
durumis AI News Japan
durumis AI News Japan
durumis AI News Japan
durumis AI News Japan

May 18, 2024