🤖 Happy FOSAI Friday! 🚀

![](https://lazysoci.al/api/v3/image_proxy?url=https%3A%2F%2Flemmy.world%2Fpictrs%2F

9mo ago

Llama 3.1 Megathread

Meta has released and open-sourced Llama 3.1 in three different sizes: 8B, 70B, and 405B

This new Llama iteration and update brings state-of-the-art performance to open-source ecosystems.

If you've had a chance to use Llama 3.1 in any of its variants - let us know how you like it and what you're using it for in the comments below!

Llama 3.1 Megathread

For this release, we evaluated performance on over 150 benchmark datasets that span a wide range of languages. In addition, we performed extensive human evaluations that compare Llama 3.1 with competing models in real-world scenarios. Our experimental evaluation suggests that our flagship model is competitive with leading foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet. Additionally, our smaller models are competitive with closed and open models that have a similar number of parameters.

2y ago

HyperTech News Report #0003 - Expanding Horizons

cross-posted from: https://lemmy.world/post/6399678

🤖 Happy FOSAI Friday! 🚀
Friday, October 6, 2023
HyperTech News Report #0003
Hello Everyone!
This week highlights a wave of new papers and frameworks that expand upon LLM functionalities. With a tsunami of applications on the horizon I foresee a bedrock of tools to preceed. I'm not sure what kits and processes will end up part of this bedrock, but I hope some of these methods end up interesting or helpful to your workflow!
Table of Contents
Community Changelog
Image of the Week
News
Tools & Frameworks
Papers
Community Changelog
Pinned Mistral Megathread
We're R&D'ing FOSAI Models!
Image of the Week
![](https://lemmy.world/pictrs/image/f3fda57b-8d21-4bb5-951e-f6d00510add

Free Open-Source Artificial Intelligence @lemmy.world

2y ago

HyperTech News Report #0003 - Expanding Horizons

cross-posted from: https://lemmy.world/post/6399678

🤖 Happy FOSAI Friday! 🚀
Friday, October 6, 2023
HyperTech News Report #0003
Hello Everyone!
This week highlights a wave of new papers and frameworks that expand upon LLM functionalities. With a tsunami of applications on the horizon I foresee a bedrock of tools to preceed. I'm not sure what kits and processes will end up part of this bedrock, but I hope some of these methods end up interesting or helpful to your workflow!
Table of Contents
Community Changelog
Image of the Week
News
Tools & Frameworks
Papers
Community Changelog
Pinned Mistral Megathread
We're R&D'ing FOSAI Models!
Image of the Week
![](https://lemmy.world/pictrs/image/f3fda57b-8d21-4bb5-951e-f6d00510add

2y ago

HyperTech News Report #0003 - Expanding Horizons

🤖 Happy FOSAI Friday! 🚀

Friday, October 6, 2023

HyperTech News Report #0003

Hello Everyone!

This week highlights a wave of new papers and frameworks that expand upon LLM functionalities. With a tsunami of applications on the horizon I foresee a bedrock of tools to preceed. I'm not sure what kits and processes will end up part of this bedrock, but I hope some of these methods end up interesting or helpful to your workflow!

Community Changelog
Image of the Week
News
Tools & Frameworks
Papers

Community Changelog

Image of the Week

This image of the week comes from one of

2y ago

HyperTech News Report #0002 - A New Challenger Approaches!

cross-posted from: https://lemmy.world/post/5965315

🤖 Happy FOSAI Friday! 🚀
Friday, September 29, 2023
HyperTech News Report #0002
Hello Everyone!
Welcome back to the HyperTech News Report! This week we're seeing some really exciting developments in futuristic technologies. With more tools and methods releasing by the day, I feel we're in for a renaissance in software. I hope hardware is soon to follow.. but I am here for it! So are you. Brace yourselves. Change is coming! This next year will be very interesting to watch unfold.
Table of Contents
New Foundation Model!
Metaverse Developments
NVIDIA NeMo Guardrails
Tutorial Highlights
Community Changelog
Cleaned up some old content (let me know if you notice something that should be archived or updated)
Image of the Week
![](https://lemmy.world/pictr

2y ago

HyperTech News Report #0002 - A New Challenger Approaches!

cross-posted from: https://lemmy.world/post/5965315

🤖 Happy FOSAI Friday! 🚀
Friday, September 29, 2023
HyperTech News Report #0002
Hello Everyone!
Welcome back to the HyperTech News Report! This week we're seeing some really exciting developments in futuristic technologies. With more tools and methods releasing by the day, I feel we're in for a renaissance in software. I hope hardware is soon to follow.. but I am here for it! So are you. Brace yourselves. Change is coming! This next year will be very interesting to watch unfold.
Table of Contents
New Foundation Model!
Metaverse Developments
NVIDIA NeMo Guardrails
Tutorial Highlights
Community Changelog
Cleaned up some old content (let me know if you notice something that should be archived or updated)
Image of the Week
![](https://lemmy.world/pictr

AI @lemmy.ml

2y ago

HyperTech News Report #0001 - Happy FOSAI Friday!

cross-posted from: https://lemmy.world/post/5549499

🤖 Happy FOSAI Friday! 🚀
Friday, September 22, 2023
HyperTech News Report #0001
Hello Everyone!
This series is a new vehicle for !fosai@lemmy.world news reports. In these posts I'll go over projects or news I stumble across week-over-week. I will try to keep Fridays consistent with this series, covering most of what I have been (but at regular cadence). For this week, I am going to do my best catching us up on a few old (and new) hot topics you may or may not have heard about already.
Table of Contents
Introducing HyperTech
New GGUF Models
Falcon 180B
Llama 3 Rumors
DALM RAG Toolkit
DALL-E 3
Community Changelog
Updated all resources on FOSAI ▲ XYZ.
Added new content to FOSAI ▲ XYZ.
Added new content and re

2y ago

HyperTech News Report #0001 - Happy FOSAI Friday!

cross-posted from: https://lemmy.world/post/5549499

🤖 Happy FOSAI Friday! 🚀
Friday, September 22, 2023
HyperTech News Report #0001
Hello Everyone!
This series is a new vehicle for !fosai@lemmy.world news reports. In these posts I'll go over projects or news I stumble across week-over-week. I will try to keep Fridays consistent with this series, covering most of what I have been (but at regular cadence). For this week, I am going to do my best catching us up on a few old (and new) hot topics you may or may not have heard about already.
Table of Contents
Introducing HyperTech
New GGUF Models
Falcon 180B
Llama 3 Rumors
DALM RAG Toolkit
DALL-E 3
Community Changelog
Updated all resources on FOSAI ▲ XYZ.
Added new content to FOSAI ▲ XYZ.
Added new content and re

2y ago

HyperTech News Report #0001 - Happy FOSAI Friday!

cross-posted from: https://lemmy.world/post/5549499

🤖 Happy FOSAI Friday! 🚀
Friday, September 22, 2023
HyperTech News Report #0001
Hello Everyone!
This series is a new vehicle for !fosai@lemmy.world news reports. In these posts I'll go over projects or news I stumble across week-over-week. I will try to keep Fridays consistent with this series, covering most of what I have been (but at regular cadence). For this week, I am going to do my best catching us up on a few old (and new) hot topics you may or may not have heard about already.
Table of Contents
Introducing HyperTech
New GGUF Models
Falcon 180B
Llama 3 Rumors
DALM RAG Toolkit
DALL-E 3
Community Changelog
Updated all resources on FOSAI ▲ XYZ.
Added new content to FOSAI ▲ XYZ.
Added new content and re

World News @lemmy.ml

2y ago

CodeLlama-34B - the First Open-Source Model Beating GPT-4 on HumanEvals

cross-posted from: https://lemmy.world/post/3879861

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B
Hello everyone! This post marks an exciting moment for !fosai@lemmy.world and everyone in the open-source large language model and AI community.
We appear to have a new contender on the block, a model apparently capable of surpassing OpenAI's state of the art ChatGPT-4 in coding evals (evaluations).
This is huge. Not too long ago I made an offhand comment on us catching up to GPT-4 within a year. I did not expect that prediction to end up being reality in half the time. Let's hope this isn't a one-off scenario and that we see a new wave of open-source models that begin to challenge OpenAI.
Buckle up, it's going to get interesting!
Here's some notes from the blog, which you should visit and read in its entirety:
https://www.phind.com/blog/code-llama-beats-gpt4
Blog Post

2y ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

cross-posted from: https://lemmy.world/post/3879861

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B
Hello everyone! This post marks an exciting moment for !fosai@lemmy.world and everyone in the open-source large language model and AI community.
We appear to have a new contender on the block, a model apparently capable of surpassing OpenAI's state of the art ChatGPT-4 in coding evals (evaluations).
This is huge. Not too long ago I made an offhand comment on us catching up to GPT-4 within a year. I did not expect that prediction to end up being reality in half the time. Let's hope this isn't a one-off scenario and that we see a new wave of open-source models that begin to challenge OpenAI.
Buckle up, it's going to get interesting!
Here's some notes from the blog, which you should visit and read in its entirety:
https://www.phind.com/blog/code-llama-beats-gpt4
Blog Post

2y ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

cross-posted from: https://lemmy.world/post/3879861

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B
Hello everyone! This post marks an exciting moment for !fosai@lemmy.world and everyone in the open-source large language model and AI community.
We appear to have a new contender on the block, a model apparently capable of surpassing OpenAI's state of the art ChatGPT-4 in coding evals (evaluations).
This is huge. Not too long ago I made an offhand comment on us catching up to GPT-4 within a year. I did not expect that prediction to end up being reality in half the time. Let's hope this isn't a one-off scenario and that we see a new wave of open-source models that begin to challenge OpenAI.
Buckle up, it's going to get interesting!
Here's some notes from the blog, which you should visit and read in its entirety:
https://www.phind.com/blog/code-llama-beats-gpt4
Blog Post

2y ago

Introducing Stable-Diffusion.cpp (Inference in Pure C/C++)

cross-posted from: https://lemmy.world/post/3549390

stable-diffusion.cpp
Introducing stable-diffusion.cpp, a pure C/C++ inference engine for Stable Diffusion! This is a really awesome implementation to help speed up home inference of diffusion models.
Tailored for developers and AI enthusiasts, this repository offers a high-performance solution for creating and manipulating images using various quantization techniques and accelerated inference.
https://github.com/leejet/stable-diffusion.cpp
Key Features:
Efficient Implementation: Utilizing plain C/C++, it operates seamlessly like llama.cpp and is built on the ggml framework.
Multiple Precision Support: Choose between 16-bit, 32-bit float, and 4-bit to 8-bit integer quantization.
Optimized Performance: Experience memory-efficient CPU inference with AVX, AVX2, and AVX512 support for x8

AI @lemmy.ml

2y ago

Cheetor - A New Multi-Modal LLM Strategy Empowered by Controllable Knowledge Re-Injection

cross-posted from: https://lemmy.world/post/3439370

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
A wild new GitHub Repo has appeared!
https://github.com/DCDmllm/Cheetah/tree/main
Today we cover Cheetah - an exciting new take on interleaving image and text context & instruction.
For higher quality images, please visit the main projects repo to see their code and approach in all of their glory.
I4 Benchmark
To facilitate research in interleaved vision-language instruction following, we build I4 (semantically Interconnected, Interleaved Image-Text Instruction-Following), an extensive large-scale benchmark of 31 tasks with diverse instructions in a uniﬁed instruction-response format, covering 20 diverse scenarios.
I4 has three important properties:
Interleaved vision-language context: all the instructions contain sequenc

LocalLLaMA @sh.itjust.works

2y ago

Cheetor - A New Multi-Modal LLM Strategy Empowered by Controllable Knowledge Re-Injection

cross-posted from: https://lemmy.world/post/3439370

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
A wild new GitHub Repo has appeared!
https://github.com/DCDmllm/Cheetah/tree/main
Today we cover Cheetah - an exciting new take on interleaving image and text context & instruction.
For higher quality images, please visit the main projects repo to see their code and approach in all of their glory.
I4 Benchmark
To facilitate research in interleaved vision-language instruction following, we build I4 (semantically Interconnected, Interleaved Image-Text Instruction-Following), an extensive large-scale benchmark of 31 tasks with diverse instructions in a uniﬁed instruction-response format, covering 20 diverse scenarios.
I4 has three important properties:
Interleaved vision-language context: all the instructions contain sequenc

2y ago

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

cross-posted from: https://lemmy.world/post/3350022

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data
Hello everyone! Today marks the first day of a new series of posts featuring projects in my GitHub Stars.
Most of these repos are FOSS & FOSAI focused, meaning they should be hackable, free, and (mostly) open-source.
We're going to kick this series off by sharing Incognito Pilot. It’s like the ChatGPT Code Interpreter but for those who prioritize data privacy.

Project Summary from ChatGPT-4:
Features:
Powered by Large Language Models like GPT-4 and Llama 2.
Run code and execute tasks with Python interpreter.
Privacy: Interacts with cloud but sensitive data stays local.
Local or Remote: Choose between local LLMs

2y ago

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

cross-posted from: https://lemmy.world/post/3350022

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data
Hello everyone! Today marks the first day of a new series of posts featuring projects in my GitHub Stars.
Most of these repos are FOSS & FOSAI focused, meaning they should be hackable, free, and (mostly) open-source.
We're going to kick this series off by sharing Incognito Pilot. It’s like the ChatGPT Code Interpreter but for those who prioritize data privacy.

Project Summary from ChatGPT-4:
Features:
Powered by Large Language Models like GPT-4 and Llama 2.
Run code and execute tasks with Python interpreter.
Privacy: Interacts with cloud but sensitive data stays local.
Local or Remote: Choose between local LLMs

2y ago

Vicuna v1.5 Has Been Released!

I used to feel the same way until I found some very interesting performance results from 3B and 7B parameter models.

Granted, it wasn’t anything I’d deploy to production - but using the smaller models to prototype quick ideas is great before having to rent a gpu and spend time working with the bigger models.

Give a few models a try! You might be pleasantly surprised. There’s plenty to choose from too. You will get wildly different results depending on your use case and prompting approach.

Let us know if you end up finding one you like! I think it is only a matter of time before we’re running 40B+ parameters at home (casually).

Click Here to be Taken to the Megathread!

2y ago

Vicuna v1.5 Has Been Released!

from !fosai@lemmy.world

Vicuna v1.5 Has Been Released!
Shoutout to GissaMittJobb@lemmy.ml for catching this in an earlier post.
Given Vicuna was a widely appreciated member of the original Llama series, it'll be exciting to see this model evolve and adapt with fresh datasets and new training and fine-tuning approaches.
Feel free using this megathread to chat about Vicuna and any of your experiences with Vicuna v1.5!
Starting off with Vicuna v1.5
TheBloke is already sharing models!
Vicuna v1.5 GPTQ
7B
Vicuna-7B-v1.5-GPTQ
Vicuna-7B-v1.5-16K-GPTQ
13B
Vicuna-13B-v1.5-GPTQ
**Vi

LocalLLaMA @sh.itjust.works

Click Here to be Taken to the Megathread!

2y ago

Vicuna v1.5 Has Been Released!

from !fosai@lemmy.world

Vicuna v1.5 Has Been Released!
Shoutout to GissaMittJobb@lemmy.ml for catching this in an earlier post.
Given Vicuna was a widely appreciated member of the original Llama series, it'll be exciting to see this model evolve and adapt with fresh datasets and new training and fine-tuning approaches.
Feel free using this megathread to chat about Vicuna and any of your experiences with Vicuna v1.5!
Starting off with Vicuna v1.5
TheBloke is already sharing models!
Vicuna v1.5 GPTQ
7B
Vicuna-7B-v1.5-GPTQ
Vicuna-7B-v1.5-16K-GPTQ
13B
Vicuna-13B-v1.5-GPTQ
**Vi