
We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Please review our community rules and introduce yourself!
Stock analysis data pipeline Episode 1
Click to view this content.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Decoding the Success of Gzip + KNN: The Central Role of LZ77
cross-posted from: https://lemmy.world/post/2706141
Decoding the Success of Gzip + KNN: The Central Role of LZ77
CUDA Templates for Linear Algebra Subroutines. Contribute to NVIDIA/cutlass development by creating an account on GitHub.
New trick scales LLMs even longer! - GitHub - jquesnelle/scaled-rope
News I've seen the posts about SuperHOT and just recently, the paper from Meta which uses RoPE interpolation, and I've noticed an immediate improvement that can be brought to this method. Basically if you apply Neural Tangent Kernel (NTK) theory to this problem, it becomes clear that simply interpolating the RoPE's fourier space "linearly" is very sub-optimal, as it prevents the network to distinguish the order and positions of tokens that are v
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer - GitHub - teknium1/GPTeacher: A collection of modular datasets generated ...
Fine tune and deploy generative models via a single API
Google Research. Contribute to google-research/google-research development by creating an account on GitHub.
All datasets in this repository are released under the CC BY 4.0 International license, which can be found here: https://creativecommons.org/licenses/by/4.0/legalcode. All source files in this repository are released under the Apache 2.0 license, the text of which can be found in the LICENSE file.
Relationship between LLM model size and emergent power
注1:本文整理自我在今年3 月 11 日 “中国人工智能学会”主办的「ChatGPT 及大模型专题研讨会」上《大型语言模型的涌现能力:现象与解释》的现场分享,介绍了大语言模型中的涌现现象,以及关于涌现能力背后原因的…
Tensor library for machine learning. Contribute to ggerganov/ggml development by creating an account on GitHub.
General technology for enabling AI capabilities w/ LLMs and MLLMs - GitHub - microsoft/LMOps: General technology for enabling AI capabilities w/ LLMs and MLLMs
GitHub - dadukhankevin/Finch: A Keras style GA genetic algorithm library
Finch is a genetic algorithm framework for python. It is also GPU compatible, making it probably the fastest genetic algorithm framework ever. Inspired by Keras. - GitHub - dadukhankevin/Finch: Fin...
Faster alternative to Metal Performance Shaders. Contribute to philipturner/metal-flash-attention development by creating an account on GitHub.