General Intelligence logo
General Intelligence
Subscribe
  • General Intelligence
  • Topics
  • vision

vision

Computer Vision and Pattern Recognition

hallucinationsdiffusionvisionagentssecurity-compliancetranslationpromptingsafetydeepfakes & fake newsmusicarchitectureRAGalgorithmsfinancevoice and text-to-speechmodelsvideo and text-to-videohealthcaredefense
visionvision
+4+4
Meta introduces LLama Guard 3 Vision
Nov 21, 2024

Meta introduces LLama Guard 3 Vision

Plus AWS' Multi Agent Orchestrator, detecting fake news, and how RL is used in finance

Chris Han
Chris Han
diffusiondiffusion
+5+5
[#30] Meta releases better technique for more efficient video-language understanding
Oct 24, 2024

[#30] Meta releases better technique for more efficient video-language understanding

Plus AMD reduces the cost of training diffusion models, lung cancer detection, autonomous driving from aerial photos, and seamless AI voice conversation

Chris Han
Chris Han
visionvision
+2+2
[#29] AI can identify food from around the world
Oct 17, 2024

[#29] AI can identify food from around the world

Plus Amazon improves fashion recommendations, detecting brain tumors from MRIs, robots imitating humans, Zillow builds a real-estate chatbot

Chris Han
Chris Han
hallucinationshallucinations
+5+5
[#27] A new kind of neural net based on Fourier Transforms
Oct 07, 2024

[#27] A new kind of neural net based on Fourier Transforms

Plus Meta's new research, new techniques to mitigate hallucinations, and generating images from brainwaves

Chris Han
Chris Han
visionvision
+3+3
[#26] This method generates 3D renderings of clothes given a single image of fabric
Oct 03, 2024

[#26] This method generates 3D renderings of clothes given a single image of fabric

Plus AI trading agents, musical AI, and better code generation

Chris Han
Chris Han
hallucinationshallucinations
+3+3
[#24] Meta releases Llama 3.2, models for mobile & edge devices
Sep 26, 2024

[#24] Meta releases Llama 3.2, models for mobile & edge devices

Plus detecting cheating during exams with AI, AI2 releases open-source language models, mitigating hallucinations in vision language models, and zero-shot detection of AI images

Chris Han
Chris Han
visionvision
+4+4
[#23] Apple's HyperCloning: Training Using Small Models
Sep 23, 2024

[#23] Apple's HyperCloning: Training Using Small Models

Plus Amazon innovates on product translations, Anthropic releases Contextual Retrieval, Detecting deepfakes with smartphones, and RLHF can mislead humans

Chris Han
Chris Han
visionvision
+2+2
[#22] NVLM: Nvidia's new family of multimodal LLM
Sep 19, 2024

[#22] NVLM: Nvidia's new family of multimodal LLM

Plus new models for human-like conversations and better RAG, chain-of-thought is only useful in math tasks, and a new health AI newsletter

Chris Han
Chris Han
The latest developments in AI research

General Intelligence

The latest developments in AI research

Home

Posts

© 2025 General Intelligence.

Privacy policy

Terms of use

Powered by beehiiv