{"id":86612,"date":"2023-10-25T16:20:15","date_gmt":"2023-10-25T08:20:15","guid":{"rendered":"https:\/\/www.seeedstudio.com\/blog\/?p=86612"},"modified":"2023-10-27T14:29:57","modified_gmt":"2023-10-27T06:29:57","slug":"what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin","status":"publish","type":"post","link":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/","title":{"rendered":"What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin"},"content":{"rendered":"\n<p>For complex computer vision tasks, we usually require machines not only to interpret complex visual data but also to comprehend the contextual intricacies through language. That&#8217;s the power of the vision-language model with multimodal capabilities, enhancing the accuracy and depth of object detection, and moreover, providing huge potential for more intuitive human-machine interaction. MiniGPT-4 is one of the interesting applications that we can dive into the world of multimodal LLM. <\/p>\n\n\n\n<p>Here we are going to talk about how the miniGPT-4 tech behind these generative AI applications are built, where they can be deployed, and how to release the most power of them on NVIDIA Jetson Orin. To explore more possibilities of generative AI tech, feel free to go through the <a href=\"http:\/\/www.jetson-ai-lab.com\/tutorial_minigpt4.html\">guidance of Jetson Generative AI Lab<\/a> to have a try!<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/DreamShaper_v7_robot_is_explaining_the_meaning_and_story_behin_0-1030x687.jpg\" alt=\"\" class=\"wp-image-86816\" width=\"773\" height=\"515\" srcset=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/DreamShaper_v7_robot_is_explaining_the_meaning_and_story_behin_0-1030x687.jpg 1030w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/DreamShaper_v7_robot_is_explaining_the_meaning_and_story_behin_0-300x200.jpg 300w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/DreamShaper_v7_robot_is_explaining_the_meaning_and_story_behin_0-768x512.jpg 768w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/DreamShaper_v7_robot_is_explaining_the_meaning_and_story_behin_0-1024x683.jpg 1024w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/DreamShaper_v7_robot_is_explaining_the_meaning_and_story_behin_0-675x450.jpg 675w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/DreamShaper_v7_robot_is_explaining_the_meaning_and_story_behin_0.jpg 1152w\" sizes=\"(max-width: 773px) 100vw, 773px\" \/><figcaption class=\"wp-element-caption\">Understanding arts in museums from machines in one click could be possible<\/figcaption><\/figure><\/div>\n\n\n<h2 class=\"wp-block-heading\">What is MiniGPT-4 <\/h2>\n\n\n\n<p><a href=\"https:\/\/arxiv.org\/pdf\/2304.10592.pdf\">MiniGPT-4<\/a> is a lightweight version of the vision-language model quite similar to ChatGPT. It is developed to verify if the sophisticated large language model can enhance the power of multimodal generation capability (We&#8217;ll talk about multimodal deep learning in the following parts). <\/p>\n\n\n\n<p>By aligning a frozen visual encoder which contains the pre-trained ViT and Q-Former with a frozen LLM &#8211; Vicuna, using only one projection layer, miniGPT-4 demonstrates many advanced multimodal capabilities similar to GPT-4, such as generating detailed image descriptions and creating websites from hand-drawn sketches, even extending to being able to write poems or giving guidance based on a given image.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" src=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/WeChatWorkScreenshot_9364a09f-c73b-4e1f-94b1-16f75f61012e-1-1030x705.png\" alt=\"\" class=\"wp-image-86813\" width=\"773\" height=\"529\" srcset=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/WeChatWorkScreenshot_9364a09f-c73b-4e1f-94b1-16f75f61012e-1-1030x705.png 1030w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/WeChatWorkScreenshot_9364a09f-c73b-4e1f-94b1-16f75f61012e-1-300x205.png 300w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/WeChatWorkScreenshot_9364a09f-c73b-4e1f-94b1-16f75f61012e-1-768x526.png 768w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/WeChatWorkScreenshot_9364a09f-c73b-4e1f-94b1-16f75f61012e-1-1536x1051.png 1536w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/WeChatWorkScreenshot_9364a09f-c73b-4e1f-94b1-16f75f61012e-1-1024x701.png 1024w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/WeChatWorkScreenshot_9364a09f-c73b-4e1f-94b1-16f75f61012e-1.png 1666w\" sizes=\"(max-width: 773px) 100vw, 773px\" \/><figcaption class=\"wp-element-caption\">Image resource from https:\/\/minigpt-4.github.io\/<\/figcaption><\/figure><\/div>\n\n\n<p>To produce a more natural language output for the machine, it&#8217;s important to eliminate the interference of noise: fine-tune the model with a detailed image description dataset rather than just using short image captions. All is for the improvement of the model&#8217;s generation reliability and better usability.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">MiniGPT-4 Methods by Two Stages<\/h3>\n\n\n\n<p><strong>1. During the pre-train process, align image-text pairs with huge amounts of data collections<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The entire process walks through 20,000 steps within about 10 hours to complete, using around 5 million image-text pairs with a batch size of 256.<\/li>\n\n\n\n<li>It turns out that the initial training shows the great power of rich knowledge to be well-responsive to human queries. However, those outputs can not be fully guaranteed to be aligned with human intentions accurately.<\/li>\n<\/ul>\n\n\n\n<p><strong>2. Vision-language alignment &#8211; Fix description error in data post-processing to fine-tune the model<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Use ChatGPT to remove repeated\/unnecessary sentences in the descriptions generated based on randomly chosen 5,000 images.<\/li>\n\n\n\n<li>Verify the correctness of each image description manually. It turns out there are 3,500 images that can be high-quality image-text pairs input for the next fine-tuning part.<\/li>\n\n\n\n<li>Besides identifying objects in images, which is the same as the BLIP-2 vision-language model, miniGPT-4 can also show the capability of understanding the retrieval of information.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">Deploy MiniGPT-4 on Jetson is Easy and Fluent\uff01<\/h3>\n\n\n\n<p>To build up your own local secure inferencing server which is independent of network limitation, it&#8217;s a considerable choice to deploy miniGPT-4 on NVIDIA Jetson AGX Orin. If you have already explored the Jetson Generative AI Lab, you might already know the basic setup workflow for miniGPT-4. Now, you can easily run it on Jetson by following these steps:<\/p>\n\n\n\n<p>1. Get one <a href=\"https:\/\/www.seeedstudio.com\/NVIDIArJetson-AGX-Orintm-64GB-Developer-Kit-p-5641.html?queryID=2b7f7917b2db5e6b86ca7a7abd56d0c0&amp;objectID=5641&amp;indexName=bazaar_retailer_products\">Jetson AGX Orin<\/a> Edge device and flash the system by checking this <a href=\"https:\/\/wiki.seeedstudio.com\/NVIDIA_Jetson\/\">wiki<\/a>.<\/p>\n\n\n\n<p>2. Run the following command in the terminal, install packages, and run miniGPT-4.<\/p>\n\n\n\n<div class=\"wp-block-blockspare-blockspare-container alignfull blockspare-b1d85e57-937d-4\" blockspare-animation=\"\"><style>.blockspare-b1d85e57-937d-4 > .blockspare-block-container-wrapper{background-color:#f9f9f9;padding-top:20px;padding-right:20px;padding-bottom:20px;padding-left:20px;margin-top:30px;margin-right:0px;margin-bottom:30px;margin-left:0px;border-radius:0}.blockspare-b1d85e57-937d-4 .blockspare-image-wrap{background-image:none}<\/style><div class=\"blockspare-block-container-wrapper blockspare-hover-item\"><div class=\"blockspare-container-background blockspare-image-wrap has-background-opacity-100 has-background-opacity\"><\/div><div class=\"blockspare-container\"><div class=\"blockspare-inner-blocks blockspare-inner-wrapper-blocks\">\n<pre class=\"wp-block-code\"><code>git clone https:\/\/github.com\/dusty-nv\/jetson-containers\ncd jetson-containers\nsudo apt update; sudo apt install -y python3-pip\npip3 install -r requirements.txt\n\n.\/run.sh $(.\/autotag minigpt4) \/bin\/bash -c 'cd \/opt\/minigpt4.cpp\/minigpt4 &amp;&amp; python3 webui.py \\\n  $(huggingface-downloader --type=dataset maknee\/minigpt4-13b-ggml\/minigpt4-13B-f16.bin) \\\n  $(huggingface-downloader --type=dataset maknee\/ggml-vicuna-v0-quantized\/ggml-vicuna-13B-v0-q5_k.bin)'<\/code><\/pre>\n<\/div><\/div><\/div><\/div>\n\n\n\n<p>3. Open a browser on the same network and enter &gt; http:\/\/&lt;Jetson_Device_IP&gt;:7860<\/p>\n\n\n\n<p>Enjoy your personal AI Chatbot! <\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/files.seeedstudio.com\/wiki\/NVIDIA\/minigpt4_2.gif\" alt=\"\"\/><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Seeed:&nbsp;NVIDIA&nbsp;Jetson Ecosystem Partner<\/h3>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter is-resized\"><img decoding=\"async\" src=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/08\/nvidia-elite-partner-badge-rgb-for-screen.jpg\" alt=\"\" class=\"wp-image-82789\" width=\"233\" height=\"109\" srcset=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/08\/nvidia-elite-partner-badge-rgb-for-screen.jpg 465w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/08\/nvidia-elite-partner-badge-rgb-for-screen-300x140.jpg 300w\" sizes=\"(max-width: 233px) 100vw, 233px\" \/><\/figure><\/div>\n\n\n<p>Seeed is an Elite partner for edge AI in the&nbsp;<a href=\"https:\/\/www.nvidia.com\/en-us\/about-nvidia\/partners\/\"><u>NVIDIA Partner Network<\/u><\/a>. Explore more carrier boards, full system devices, customization services, use cases, and developer tools on&nbsp;<a href=\"https:\/\/www.seeedstudio.com\/nvidia-jetson.html\"><u>Seeed\u2019s&nbsp;NVIDIA Jetson ecosystem<\/u><\/a>&nbsp;page.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/09\/WeChatWorkScreenshot_0fa3f479-5b1c-4e23-9896-b989d6bde69c-3-1030x599.png\" alt=\"\" class=\"wp-image-84528\" width=\"773\" height=\"449\" srcset=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/09\/WeChatWorkScreenshot_0fa3f479-5b1c-4e23-9896-b989d6bde69c-3-1030x599.png 1030w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/09\/WeChatWorkScreenshot_0fa3f479-5b1c-4e23-9896-b989d6bde69c-3-300x174.png 300w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/09\/WeChatWorkScreenshot_0fa3f479-5b1c-4e23-9896-b989d6bde69c-3-768x446.png 768w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/09\/WeChatWorkScreenshot_0fa3f479-5b1c-4e23-9896-b989d6bde69c-3-1536x893.png 1536w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/09\/WeChatWorkScreenshot_0fa3f479-5b1c-4e23-9896-b989d6bde69c-3-1024x595.png 1024w, https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/09\/WeChatWorkScreenshot_0fa3f479-5b1c-4e23-9896-b989d6bde69c-3.png 1958w\" sizes=\"(max-width: 773px) 100vw, 773px\" \/><\/figure><\/div>\n\n\n<p>Join the forefront of AI innovation with us! Harness the power of cutting-edge hardware and technology to revolutionize the deployment of machine learning in the real world across industries. Be a part of our mission to provide developers and enterprises with the best ML solutions available. Check out our successful&nbsp;<a href=\"https:\/\/files.seeedstudio.com\/wiki\/NVIDIA\/NVIDIA_Jetson_example-Success_cases_and_examples_with_NVIDIA_Jetson.pdf\">case study catalog<\/a>&nbsp;to discover more edge AI possibilities!<\/p>\n\n\n\n<p>Take the first step and send us an email at&nbsp;<a href=\"mailto:edgeai@seeed.cc\">edgeai@seeed.cc<\/a>&nbsp;to become a part of this exciting journey!&nbsp;<\/p>\n\n\n\n<p>Download our latest&nbsp;<a href=\"https:\/\/files.seeedstudio.com\/wiki\/Seeed_Jetson\/Seeed-NVIDIA_Jetson_Catalog_V1.4.pdf\">Jetson Catalog<\/a>&nbsp;to find one option that suits you well. If you can\u2019t find the off-the-shelf Jetson hardware solution for your needs, please check out our&nbsp;<a href=\"https:\/\/www.seeedstudio.com\/odm\">customization services<\/a>, and submit a new product inquiry to us at&nbsp;odm@seeed.cc&nbsp;for evaluation.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>For complex computer vision tasks, we usually require machines not only to interpret complex visual<\/p>\n","protected":false},"author":3606,"featured_media":86827,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_lmt_disableupdate":"","_lmt_disable":"","_price":"","_stock":"","_tribe_ticket_header":"","_tribe_default_ticket_provider":"","_tribe_ticket_capacity":"0","_ticket_start_date":"","_ticket_end_date":"","_tribe_ticket_show_description":"","_tribe_ticket_show_not_going":false,"_tribe_ticket_use_global_stock":"","_tribe_ticket_global_stock_level":"","_global_stock_mode":"","_global_stock_cap":"","_tribe_rsvp_for_event":"","_tribe_ticket_going_count":"","_tribe_ticket_not_going_count":"","_tribe_tickets_list":"[]","_tribe_ticket_has_attendee_info_fields":false,"iawp_total_views":0,"footnotes":""},"categories":[1,4393],"tags":[1358,4728,4440,4726,4729,4727,1312,4725],"class_list":["post-86612","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","category-tech","tag-deep-learning","tag-gpt4","tag-jetson-orin","tag-llm","tag-minigpt4","tag-multimodal","tag-nvidia","tag-vision-language-model"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.0 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin - Latest News from Seeed Studio<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin - Latest News from Seeed Studio\" \/>\n<meta property=\"og:description\" content=\"For complex computer vision tasks, we usually require machines not only to interpret complex visual\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/\" \/>\n<meta property=\"og:site_name\" content=\"Latest News from Seeed Studio\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-25T08:20:15+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-10-27T06:29:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"628\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Jennie Wang\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Jennie Wang\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/\",\"url\":\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/\",\"name\":\"What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin - Latest News from Seeed Studio\",\"isPartOf\":{\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png\",\"datePublished\":\"2023-10-25T08:20:15+00:00\",\"dateModified\":\"2023-10-27T06:29:57+00:00\",\"author\":{\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/#\/schema\/person\/21041ae3908bbb4d44533f2b3b115fd1\"},\"breadcrumb\":{\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#primaryimage\",\"url\":\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png\",\"contentUrl\":\"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png\",\"width\":1200,\"height\":628},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.seeedstudio.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/#website\",\"url\":\"https:\/\/www.seeedstudio.com\/blog\/\",\"name\":\"Latest News from Seeed Studio\",\"description\":\"Emerging IoT, AI and Autonomous Applications on the Edge\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.seeedstudio.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/#\/schema\/person\/21041ae3908bbb4d44533f2b3b115fd1\",\"name\":\"Jennie Wang\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.seeedstudio.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/b8fdf0c9ad5c32ab4f3981bb35a10566?s=96&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/b8fdf0c9ad5c32ab4f3981bb35a10566?s=96&r=g\",\"caption\":\"Jennie Wang\"},\"description\":\"Seeed Studio AIoT Marketing and Partnership Always coffee always alive \u2615\ufe0f\",\"sameAs\":[\"www.linkedin.com\/in\/jialinwang1215\"],\"url\":\"https:\/\/www.seeedstudio.com\/blog\/author\/jennie-wang\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin - Latest News from Seeed Studio","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/","og_locale":"en_US","og_type":"article","og_title":"What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin - Latest News from Seeed Studio","og_description":"For complex computer vision tasks, we usually require machines not only to interpret complex visual","og_url":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/","og_site_name":"Latest News from Seeed Studio","article_published_time":"2023-10-25T08:20:15+00:00","article_modified_time":"2023-10-27T06:29:57+00:00","og_image":[{"width":1200,"height":628,"url":"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png","type":"image\/png"}],"author":"Jennie Wang","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Jennie Wang","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/","url":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/","name":"What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin - Latest News from Seeed Studio","isPartOf":{"@id":"https:\/\/www.seeedstudio.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#primaryimage"},"image":{"@id":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#primaryimage"},"thumbnailUrl":"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png","datePublished":"2023-10-25T08:20:15+00:00","dateModified":"2023-10-27T06:29:57+00:00","author":{"@id":"https:\/\/www.seeedstudio.com\/blog\/#\/schema\/person\/21041ae3908bbb4d44533f2b3b115fd1"},"breadcrumb":{"@id":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#primaryimage","url":"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png","contentUrl":"https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png","width":1200,"height":628},{"@type":"BreadcrumbList","@id":"https:\/\/www.seeedstudio.com\/blog\/2023\/10\/25\/what-is-minigpt-4-a-deep-dive-in-multimodal-llm-and-deploy-on-jetson-orin\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.seeedstudio.com\/blog\/"},{"@type":"ListItem","position":2,"name":"What is MiniGPT-4? A Deep Dive and Deploy on Jetson Orin"}]},{"@type":"WebSite","@id":"https:\/\/www.seeedstudio.com\/blog\/#website","url":"https:\/\/www.seeedstudio.com\/blog\/","name":"Latest News from Seeed Studio","description":"Emerging IoT, AI and Autonomous Applications on the Edge","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.seeedstudio.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.seeedstudio.com\/blog\/#\/schema\/person\/21041ae3908bbb4d44533f2b3b115fd1","name":"Jennie Wang","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.seeedstudio.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/b8fdf0c9ad5c32ab4f3981bb35a10566?s=96&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/b8fdf0c9ad5c32ab4f3981bb35a10566?s=96&r=g","caption":"Jennie Wang"},"description":"Seeed Studio AIoT Marketing and Partnership Always coffee always alive \u2615\ufe0f","sameAs":["www.linkedin.com\/in\/jialinwang1215"],"url":"https:\/\/www.seeedstudio.com\/blog\/author\/jennie-wang\/"}]}},"modified_by":"Jennie Wang","views":5910,"featured_image_urls":{"full":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png",1200,628,false],"thumbnail":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog-80x80.png",80,80,true],"medium":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog-300x157.png",300,157,true],"medium_large":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog-768x402.png",640,335,true],"large":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog-1030x539.png",640,335,true],"1536x1536":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png",1200,628,false],"2048x2048":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png",1200,628,false],"visody_icon":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png",32,17,false],"magazine-7-slider-full":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog.png",1200,628,false],"magazine-7-slider-center":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog-936x628.png",936,628,true],"magazine-7-featured":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog-1024x536.png",1024,536,true],"magazine-7-medium":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog-720x380.png",720,380,true],"magazine-7-medium-square":["https:\/\/www.seeedstudio.com\/blog\/wp-content\/uploads\/2023\/10\/blog-675x450.png",675,450,true]},"author_info":{"display_name":"Jennie Wang","author_link":"https:\/\/www.seeedstudio.com\/blog\/author\/jennie-wang\/"},"category_info":"<a href=\"https:\/\/www.seeedstudio.com\/blog\/category\/news\/\" rel=\"category tag\">News<\/a> <a href=\"https:\/\/www.seeedstudio.com\/blog\/category\/tech\/\" rel=\"category tag\">Tech<\/a>","tag_info":"Tech","comment_count":"0","_links":{"self":[{"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/posts\/86612","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/users\/3606"}],"replies":[{"embeddable":true,"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/comments?post=86612"}],"version-history":[{"count":25,"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/posts\/86612\/revisions"}],"predecessor-version":[{"id":91593,"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/posts\/86612\/revisions\/91593"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/media\/86827"}],"wp:attachment":[{"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/media?parent=86612"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/categories?post=86612"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.seeedstudio.com\/blog\/wp-json\/wp\/v2\/tags?post=86612"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}