{"id":17750,"date":"2024-10-15T15:07:11","date_gmt":"2024-10-15T13:07:11","guid":{"rendered":"https:\/\/kingkong-mag.com\/?p=17750"},"modified":"2024-10-22T10:29:51","modified_gmt":"2024-10-22T08:29:51","slug":"choosing-an-ai-model","status":"publish","type":"post","link":"https:\/\/kingkong-mag.com\/en\/choosing-an-ai-model\/","title":{"rendered":"Choosing an AI model, the artists\u2019 conundrum (2\/2)"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">The effects of the dataset<\/p>\n\n\n<p class=\"lead\">\r\nDespite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies. \r\n<\/p>\n\n\n<p class=\"wp-block-paragraph\"><strong><em>This article is a republication. The original was published on <a href=\"https:\/\/hacnum.org\/hacnumedia-articles\/\" target=\"_blank\" rel=\"noreferrer noopener\">HACNUMedia<\/a> (the media that explores the connections between technology and creativity), a partner of kingkong.<\/em><\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Before getting into the crux of the subject, it is important to understand the utility of a set of data and revisit several technical aspects. A dataset consists of data (numbers, texts, images) and serves to train algorithms. Its main function is therefore to provide the algorithm with a diversity of examples in order for it to learn to recognise patterns, take decisions and make predictions. In other words, the dataset is indispensable for the AI systems which are the Large Language Models (LLMs) designed to process and generate text (ChatGPT, Google Gemini, LLaMA and Claude) or the image generators (Midjourney, Stable Diffusion, DALL-E). These models enabling texts or images to be generated are trained on enormous quantities of data found on websites and social networks (the most widely known technique is called web scraping), collected with the \u2013 more or less informed \u2013 consent of internet users. Let us first of all focus on the data used to train the LLMs and the image generators.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"1920\" height=\"1080\" src=\"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2024\/10\/Quentin-Sombsthay_Image-Latente_2.png\" alt=\"\" class=\"wp-image-17735\"\/><figcaption class=\"wp-element-caption\">\u00a9 Quentin Sombsthay<\/figcaption><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\">A mechanics of standardisation<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">This data, whilst substantial, remains generic: it to some extent forms a large scan of the internet and can lead to a standardisation of the responses generated. Conceptually, \u2018<em>with these models based on Big Data, you get a kind of photograph of the internet\u2019s collective unconscious. It\u2019s interesting, but the motor of creativity rests on constraint, for example on a restricted dataset<\/em>,\u2019 notes the artist <a href=\"https:\/\/justineemard.com\/works\/\" target=\"_blank\" rel=\"noreferrer noopener\">Justine Emard<\/a>, who has turned AI into the guiding thread of his work. Over time, this globalised dataset could have an unintended effect: the datasets gleaned from web scraping or web crawling (an indexing technique to automatically explore the Web) progressively include content supplied by generative AI models, thus contaminating existing models. That may trigger a cannibalisation of aesthetics effect. An hypothesis which is far from having been verified, given the way the LLMs continually evolve.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Likewise, the question of biases reinforcing racism or sexism, for example, is raised in the construction of these datasets. An aspect which should not be overlooked, but one which deserves being examined more deeply. The artist Gr\u00e9gory Chatonsky precisely shares his expertise in an article published on his website: \u2018<em>Statistical induction is criticized for its propensity to highlight certain points of view [\u2026] What is required in return? An absence of bias? Other biases? A transparency and a readability without any remnants of these biases? If AI is being fantastically transformed into an autonomous person with a personality, it should be viewed as a new way of browsing and consulting a library<\/em>.\u2019 To put it differently, let us not be blinded by subjectivity, nor na\u00efve in this desire for objectivity and neutrality. \u2018<em>Criticism sets aside the historical hermeneutics inherent in all reading, and delegates even further to AI\u2019s automatisms what should constitute our faculty of reflexivity. Criticism, as is often the case, reproduces what it believes it is contesting. By staging AI\u2019s power of truth, it institutes it<\/em>.\u2019<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Creating a made-to-measure dataset<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Certain artists choose to build their own datasets in order to preserve their singularity, to cultivate their subjectivity. Isma\u00ebl Joffroy Chandoutis, an artist working at the intersection between contemporary art and cinema (read the article published on HACNUMedia) is an AI and deepfake specialist. He outlines several methods for integrating a dataset. For example, for an LLM, \u2018it is possible to \u2018<em>create your dataset by pasting texts in the chat window. In this case, you must respect the word limit which the model can process, what is called the context-length token. If you need more, you can use a system which searches for additional information via RAG (retrieval-augmentation generation). That entails the building of an external database on a local or online server<\/em>.\u2019 Whilst the techniques for dataset creation differ again when it involves the generation of images, videos or sound, the issue remains crucial in every instance.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The artist Bruno Ribeiro, the author of several works produced by means of AI such as<a href=\"https:\/\/www.polydactylie.com\/\" target=\"_blank\" rel=\"noreferrer noopener\"> Polydactylie\u00a0<\/a>and\u00a0<a href=\"https:\/\/ribeirobruno.com\/CELLULO-D\" target=\"_blank\" rel=\"noreferrer noopener\">CELLULOD\/D<\/a>, sees in this a way of being \u2018<em>independent and unique. My work MOTION <\/em>(Editor\u2019s note: presented at the Metahaus from October 18 to 25, 2024<em>) which is an homage to the galloping horse of Eadweard Muybridge was produced on the basis of images from other films. I wanted it to be images which I knew, which I had chosen<\/em>.\u2019 While the machine enables one to go beyond what the eye cannot see, here the subjectivity of the artist remains at the centre of the approach adopted. Justine Emard shares his thought process during the creation of <a href=\"https:\/\/justineemard.com\/hyperphantasia\/\" target=\"_blank\" rel=\"noreferrer noopener\">Hyperphantasia<\/a>. For this work, an AI in Machine Learning (other than a LLM) was trained on a scientific database of the Chauvet Pont-d\u2019Arc cave in order to manufacture new images. \u2018<em>I didn\u2019t want to enter into a fantasy of prehistory. I wanted to stay within a form of abstraction. I therefore worked with the archaeologist Jean-Michel Geneste on a restricted dataset based on several thousand raw images. We selected and augmented them in an intelligent manner. With a generic database, the result would have been totally different<\/em>.\u2019\u00a0<\/p>\n\n\n\n<figure class=\"wp-block-video\"><video controls src=\"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2024\/10\/MOTION.mp4\"><\/video><figcaption class=\"wp-element-caption\">MOTION \u2013 Bruno Ribeiro<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Although popular belief would have it that AI generates content instantaneously (and that it is therefore synonymous with saving time), it should be made clear that the building up of datasets is a long-term undertaking. \u2018<em>The establishing of datasets demands time. When you get started on the work you think it is quicker but everything takes longer<\/em>,\u2019 warns Bruno Ribeiro. And there are numerous stages: selecting the data, data pre-processing (cleaning), dividing the dataset into training data and test data (verifying the quality of the model), training, readjustment, assessment, improvement, etc<em>.<\/em> \u2018<em>The training takes several hours but the upstream and downstream phases can take months. It is also necessary to take the time to view hundreds of images created. It is a multi-layered process which is not instantaneous, unlike a prompt which generates an immediate image<\/em>,\u2019 testifies Justine Emard.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Environmental and social impacts<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">The dataset issue can also be examined from an environmental and social perspective. The LLMs rely on colossal datasets and require considerable computational power and resources. \u2018<em>This takes us back to the materiality of the digital. The data is stored on gigantic servers which need significant energy resources. The production of electricity, but also semi-conductors which entail the massive extraction of silicon and rare earths<\/em>,\u2019 explains Isma\u00ebl Joffroy Chandoutis. The data is treated by processors (GPU; Graphics Processing Units) carrying out AI calculations as well as video and graphics rendering. A model like ChatGPT makes use of several hundreds of thousands of GPUs for each training. Whilst no official announcement has been forthcoming, several sources estimate that the training of GPT-4o probably required at least 25,000 high performance GPUs over several months. \u2018<em>The NVIDIA H100 models are widely used today. Chips such as those contained in Neural Engine (Apple) look to optimise their impact by being more specialised and focusing solely on AI calculations. Despite everything, even though the environmental cost of these processors is being reduced, the levels remain astronomical<\/em>,\u2019 adds Isma\u00ebl Joffroy Chandoutis.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Otherwise, it is important to remind ourselves that numerous AI models, including those termed \u2018unsupervised\u2019, are calibrated thanks to human intervention. These \u2018clickworkers\u2019, often based in Madagascar or South-East Asia, are given the task of \u2018<em>annotating texts or images, in order to construct the learning corpus, for example by indicating on the photo of a crossroads which are the road signs, or by identifying the traces of rust on telegraph posts, or by noticing if a customer is in the act of stealing in a shop<\/em>,\u2019 explains the sociologist Cl\u00e9ment Le Ludec in \u2018Le Monde\u2019. \u2018<em>Even what is termed generative AI is affected. ChatGPT required many annotations to teach the programme what an acceptable response is or isn\u2019t, depending on a certain scale of values. In our database of companies making use of these human tasks, a third belong to the natural language processing sector<\/em>.\u2019 The work currently being created by <a href=\"https:\/\/www.scam.fr\/actualites-ressources\/quentin-sombsthay-laureat-du-prix-emergences-2023\/\">Quentin Sombsthay<\/a>, \u2018Latente Image\u2019 (2023 SCAM \u00e9mergences Prize), precisely retraces the post-traumatic stress disorders experienced by these clickworkers based in Nairobi in Kenya. Paid 2$ per hour, they manually sort ultra-violent content in order to develop the censorship practiced by Chat GPT.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"1920\" height=\"1080\" src=\"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2024\/10\/Quentin-Sombsthay_Image-Latente.png\" alt=\"\" class=\"wp-image-17737\"\/><figcaption class=\"wp-element-caption\">\u00a9 Quentin Sombsthay<\/figcaption><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Training of data locally<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Here again, the artists can minimise their environmental impact. \u2018<em>It\u2019s an assessment each person has to make. There are compromises to be made between artistic coherence, financial resources and personal ethics. Personally, I wanted to work on a local server which redistributes the heat<\/em>,\u2019 shares Justine Emard. Training on local servers is on the other hand less compatible with the LLMs. \u2018<em>Technically, it\u2019s possible, but that would require very powerful computers, which few people have access to, to process all of this data, otherwise the experience would be marred<\/em>,\u2019 points out Isma\u00ebl Joffroy Chandoutis. Hence the current enthusiasm in the world of Tech for SLM (Small Language Models). The difference between a Small Language Model and a Large Language Model principally lies in the size of their architecture, their computing capacity and, certainly, in the quantity of the training data. \u2018<em>We are progressively moving towards a cohabitation between the LLM and SLM models. For example, the strategy employed by Apple is to pivot their AI to the latest iPhones, in other words to local storage and with few watts<\/em>,\u2019 he adds.\u00a0\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The training of data can also be optimised (finetuning) via the des LoRAs (Low Rank Adaptation) technique which allows large size models to be finetuned. Nevertheless, it is worthwhile remembering that that the LLMs and the image generators are not the sole models on the market: there are other less advanced models based on Machine Learning, for example, and which may be all the better suited. Marc Chemillier, the Director of Studies at the Paris-based EHESS (School for Advanced Studies in Social Sciences), is the co-creator (together with other IRCAM (Institute for Research and Coordination in Acoustics\/Music) researchers) of Djazz. \u2018<em>With the type of AI we use, the resources are very limited. Our model is not based on deep learning, it is a transition probability model. We can do impressive things with low quantities of data and little equipment. You just need to have a microphone and a computer. Then the software captures a musical flow and learns to play like it. It is an agnostic model without a particular rhythmic signature, just a regular beat which organises the data. The musical knowledge is in the flow we capture, then the AI creates an improvisation<\/em>.\u2019<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Data subject to copyright<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Finally, the sources of the data have been the subject of heated exchanges on the issue of copyright. Michal Seta, a creative technologist at Lab 148 in Montreal, sums up the debate: \u2018<em>there are several aspects to be analysed: the rights to the model<\/em> (<a href=\"https:\/\/hacnum.org\/hacnumedia\/choisir-un-modele-dia-le-casse-tete-des-artistes-1-2\/\" target=\"_blank\" rel=\"noreferrer noopener\">read the article published on HACNUMedia<\/a>) <em>but also the rights to the data which serves to train the AI and the protection of the production of a work generated by AI<\/em>. <em>The issue is one of knowing where the training data comes from. Models such as ChatGPT are trained on the basis of online media, Wikipedia, content published on social networks. These big companies are completely opaque concerning their dataset<\/em>.\u2019 In a similar vein, \u2018<em>the data sent in ChatGPT is used to train OpenAI. There is both a problem of confidentiality and one of consent<\/em>.\u2019 Recently an <a href=\"https:\/\/hacnum.org\/hacnumedia\/ia-droits-dauteur-finalement-qui-est-lartiste\/\" target=\"_blank\" rel=\"noreferrer noopener\">article published on HACNUMedia<\/a> raised the question in these terms: \u2018can the artists whose work has been provided to an AI be considered co-authors?\u2019 If the recent publication of the <a href=\"https:\/\/artificialintelligenceact.eu\/fr\/\" target=\"_blank\" rel=\"noreferrer noopener\">AI ACT<\/a> attempts to come up with responses, in particular with the obligation for these generative AIs to publish a detailed summary of the sources used for training, there is little room for manoeuvre. The anti-AI Watermarks (information subtly included within an image, a text or a video and used to protect creations against unauthorised use), or other projects such as the website\u00a0 <a href=\"https:\/\/haveibeentrained.com\/about\" target=\"_blank\" rel=\"noreferrer noopener\">HaveIBeenTrained<\/a> which enable artists to have their images withdrawn from the databases of LLMs such as Stable Diffusion, are valuable but in the end have little impact. In this game of cat and mouse, the artists finally come up against an irrefutable reality: data is well and truly the black gold of the 21<sup>st<\/sup> century.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The effects of the dataset This article is a republication. The original was published on HACNUMedia (the media that explores the connections between technology and creativity), a partner of kingkong. Before getting into the crux of the subject, it is important to understand the utility of a set of data and revisit several technical aspects. [&hellip;]<\/p>\n","protected":false},"author":25,"featured_media":17742,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"wds_primary_category":0,"wds_primary_type_article":0,"footnotes":""},"categories":[99,109,101],"tags":[126],"type_article":[56],"class_list":["post-17750","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-art-en","category-digital-en","category-innovation-en","tag-ai","type_article-article-en"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Choosing an AI model, the artists\u2019 conundrum (2\/2) - kingkong<\/title>\n<meta name=\"description\" content=\"Despite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Choosing an AI model, the artists\u2019 conundrum (2\/2)\" \/>\n<meta property=\"og:description\" content=\"Despite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/\" \/>\n<meta property=\"og:site_name\" content=\"kingkong\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/kingkong.be\" \/>\n<meta property=\"article:published_time\" content=\"2024-10-15T13:07:11+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-10-22T08:29:51+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2024\/10\/MOTION-660x630.png\" \/>\n\t<meta property=\"og:image:width\" content=\"660\" \/>\n\t<meta property=\"og:image:height\" content=\"630\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Adrien Cornelissen\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:title\" content=\"Choosing an AI model, the artists\u2019 conundrum (2\/2)\" \/>\n<meta name=\"twitter:description\" content=\"Despite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies.\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2024\/10\/MOTION.png\" \/>\n<meta name=\"twitter:creator\" content=\"@kingkong_kikk\" \/>\n<meta name=\"twitter:site\" content=\"@kingkong_kikk\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Adrien Cornelissen\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"12 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/\"},\"author\":{\"name\":\"Adrien Cornelissen\",\"@id\":\"https:\/\/kingkong-mag.com\/#\/schema\/person\/1dcf567c6938f5e019824f817b17b3bc\"},\"headline\":\"Choosing an AI model, the artists\u2019 conundrum (2\/2)\",\"datePublished\":\"2024-10-15T13:07:11+00:00\",\"dateModified\":\"2024-10-22T08:29:51+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/\"},\"wordCount\":2091,\"publisher\":{\"@id\":\"https:\/\/kingkong-mag.com\/#organization\"},\"keywords\":[\"AI\"],\"articleSection\":[\"Art\",\"Digital\",\"Innovation\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/\",\"url\":\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/\",\"name\":\"Choosing an AI model, the artists\u2019 conundrum (2\/2) - kingkong\",\"isPartOf\":{\"@id\":\"https:\/\/kingkong-mag.com\/#website\"},\"datePublished\":\"2024-10-15T13:07:11+00:00\",\"dateModified\":\"2024-10-22T08:29:51+00:00\",\"description\":\"Despite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies.\",\"breadcrumb\":{\"@id\":\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Accueil\",\"item\":\"https:\/\/kingkong-mag.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Choosing an AI model, the artists\u2019 conundrum (2\/2)\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/kingkong-mag.com\/#website\",\"url\":\"https:\/\/kingkong-mag.com\/\",\"name\":\"kingkong\",\"description\":\"Creative Culture Media\",\"publisher\":{\"@id\":\"https:\/\/kingkong-mag.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/kingkong-mag.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/kingkong-mag.com\/#organization\",\"name\":\"kingkong\",\"alternateName\":\"kikk asbl\",\"url\":\"https:\/\/kingkong-mag.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/kingkong-mag.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2022\/11\/kikk_asbl-copie.png\",\"contentUrl\":\"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2022\/11\/kikk_asbl-copie.png\",\"width\":916,\"height\":367,\"caption\":\"kingkong\"},\"image\":{\"@id\":\"https:\/\/kingkong-mag.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/kingkong.be\",\"https:\/\/twitter.com\/kingkong_kikk\",\"https:\/\/www.tiktok.com\/@kingkong.kikk\",\"https:\/\/www.instagram.com\/kingkong.kikk\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/kingkong-mag.com\/#\/schema\/person\/1dcf567c6938f5e019824f817b17b3bc\",\"name\":\"Adrien Cornelissen\",\"description\":\"Au fil de ses exp\u00e9riences, Adrien Cornelissen a d\u00e9velopp\u00e9 une expertise sur les probl\u00e9matiques li\u00e9es \u00e0 l'innovation et la cr\u00e9ation num\u00e9rique. Il a collabor\u00e9 avec une dizaine de magazines fran\u00e7ais dont Fisheye Immersive, XRMust, Usbek &amp; Rica, Nectart ou la Revue AS. Il coordonne HACNUMedia qui explore les mutations engendr\u00e9es par les technologies dans la cr\u00e9ation contemporaine. Adrien Cornelissen intervient dans des \u00e9tablissements d\u2019enseignement sup\u00e9rieur et des structures de la cr\u00e9ation.\",\"sameAs\":[\"https:\/\/www.linkedin.com\/in\/adrien-cornelissen-435810135\/\"],\"url\":\"https:\/\/kingkong-mag.com\/en\/author\/adrien-cornelissen\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Choosing an AI model, the artists\u2019 conundrum (2\/2) - kingkong","description":"Despite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/","og_locale":"en_US","og_type":"article","og_title":"Choosing an AI model, the artists\u2019 conundrum (2\/2)","og_description":"Despite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies.","og_url":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/","og_site_name":"kingkong","article_publisher":"https:\/\/www.facebook.com\/kingkong.be","article_published_time":"2024-10-15T13:07:11+00:00","article_modified_time":"2024-10-22T08:29:51+00:00","og_image":[{"width":660,"height":630,"url":"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2024\/10\/MOTION-660x630.png","type":"image\/png"}],"author":"Adrien Cornelissen","twitter_card":"summary_large_image","twitter_title":"Choosing an AI model, the artists\u2019 conundrum (2\/2)","twitter_description":"Despite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies.","twitter_image":"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2024\/10\/MOTION.png","twitter_creator":"@kingkong_kikk","twitter_site":"@kingkong_kikk","twitter_misc":{"Written by":"Adrien Cornelissen","Est. reading time":"12 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/#article","isPartOf":{"@id":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/"},"author":{"name":"Adrien Cornelissen","@id":"https:\/\/kingkong-mag.com\/#\/schema\/person\/1dcf567c6938f5e019824f817b17b3bc"},"headline":"Choosing an AI model, the artists\u2019 conundrum (2\/2)","datePublished":"2024-10-15T13:07:11+00:00","dateModified":"2024-10-22T08:29:51+00:00","mainEntityOfPage":{"@id":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/"},"wordCount":2091,"publisher":{"@id":"https:\/\/kingkong-mag.com\/#organization"},"keywords":["AI"],"articleSection":["Art","Digital","Innovation"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/","url":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/","name":"Choosing an AI model, the artists\u2019 conundrum (2\/2) - kingkong","isPartOf":{"@id":"https:\/\/kingkong-mag.com\/#website"},"datePublished":"2024-10-15T13:07:11+00:00","dateModified":"2024-10-22T08:29:51+00:00","description":"Despite the proliferation of generative artificial intelligence models, many artists still have a restricted view of the tools on the market and their impact. Having already looked into the issues of proprietary, free and open source models, this second part breaks down an essential point of these models: the dataset. Its makeup and the nature of the data have varied repercussions on the standardisation of aesthetics, on the environmental impacts or on the questions relative to copyright. An analysis based on expert testimonies.","breadcrumb":{"@id":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/kingkong-mag.com\/choisir-un-modele-dia-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Accueil","item":"https:\/\/kingkong-mag.com\/"},{"@type":"ListItem","position":2,"name":"Choosing an AI model, the artists\u2019 conundrum (2\/2)"}]},{"@type":"WebSite","@id":"https:\/\/kingkong-mag.com\/#website","url":"https:\/\/kingkong-mag.com\/","name":"kingkong","description":"Creative Culture Media","publisher":{"@id":"https:\/\/kingkong-mag.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/kingkong-mag.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/kingkong-mag.com\/#organization","name":"kingkong","alternateName":"kikk asbl","url":"https:\/\/kingkong-mag.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/kingkong-mag.com\/#\/schema\/logo\/image\/","url":"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2022\/11\/kikk_asbl-copie.png","contentUrl":"https:\/\/kingkong-mag.com\/wp-content\/uploads\/2022\/11\/kikk_asbl-copie.png","width":916,"height":367,"caption":"kingkong"},"image":{"@id":"https:\/\/kingkong-mag.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/kingkong.be","https:\/\/twitter.com\/kingkong_kikk","https:\/\/www.tiktok.com\/@kingkong.kikk","https:\/\/www.instagram.com\/kingkong.kikk\/"]},{"@type":"Person","@id":"https:\/\/kingkong-mag.com\/#\/schema\/person\/1dcf567c6938f5e019824f817b17b3bc","name":"Adrien Cornelissen","description":"Au fil de ses exp\u00e9riences, Adrien Cornelissen a d\u00e9velopp\u00e9 une expertise sur les probl\u00e9matiques li\u00e9es \u00e0 l'innovation et la cr\u00e9ation num\u00e9rique. Il a collabor\u00e9 avec une dizaine de magazines fran\u00e7ais dont Fisheye Immersive, XRMust, Usbek &amp; Rica, Nectart ou la Revue AS. Il coordonne HACNUMedia qui explore les mutations engendr\u00e9es par les technologies dans la cr\u00e9ation contemporaine. Adrien Cornelissen intervient dans des \u00e9tablissements d\u2019enseignement sup\u00e9rieur et des structures de la cr\u00e9ation.","sameAs":["https:\/\/www.linkedin.com\/in\/adrien-cornelissen-435810135\/"],"url":"https:\/\/kingkong-mag.com\/en\/author\/adrien-cornelissen\/"}]}},"_links":{"self":[{"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/posts\/17750","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/users\/25"}],"replies":[{"embeddable":true,"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/comments?post=17750"}],"version-history":[{"count":1,"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/posts\/17750\/revisions"}],"predecessor-version":[{"id":17764,"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/posts\/17750\/revisions\/17764"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/media\/17742"}],"wp:attachment":[{"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/media?parent=17750"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/categories?post=17750"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/tags?post=17750"},{"taxonomy":"type_article","embeddable":true,"href":"https:\/\/kingkong-mag.com\/en\/wp-json\/wp\/v2\/type_article?post=17750"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}