Machine Learning Awesome

GPT Awesome
Stable Diffusion Awesome
OCR Awesome
GokuMohandas/MadeWithML
josephmisiti/awesome-machine-learning
wunderwuzzi23/mlattacks
tangramdotdev/tangram
- HN
facebookresearch/moco-v3
facebookresearch/ParlAI
- BlenderBot 2.0: An open source chatbot that builds long-term memory and searches the internet
- InfoQ BlenderBot 2.0 Chatbot
JianshuZhang/WAP
salesforce/warp-drive
isl-org/MiDaS
NVlabs/stylegan3
PaddlePaddle/Knover
google-research-datasets/timedial
- https://arxiv.org/abs/2106.04571
google-research-datasets/disfl-qa
- https://arxiv.org/abs/2106.04016
vijaydwivedi75/gnn-lspe
microsoft/onnxruntime
PyTorch 1.10 HN
facebookresearch/salina
facebookresearch/ppuda
wuhaozhe/style_avatar
facebook/prophet
- producing high quality forecasts for time series data
facebookresearch/Kats
- analyze time series data
deepinsight/insightface
- 2D and 3D Face Analysis
VowpalWabbit/vowpal_wabbit
- frontier of machine learning
wav2vec
https://nv-tlabs.github.io/editGAN/
bryandlee/animegan2-pytorch AnimeGANv2 HN
yeemachine/kalidokit
google-research/ibc
- Implicit Behavioral Cloning
bhky/opennsfw2
- Yahoo Open-NSFW model
eugeneyan/applied-ml
PeterL1n/RobustVideoMatting
yuval-alaluf/hyperstyle
SysCV/pcan
tinyfool/VideoRemoveBackground
open-mmlab/mmhuman3d
BaltiApps/Pixelify-Google-Photos
sedthh/pyxelate
facebookresearch/theseus
onion-liu/aahq-dataset
- Artstation-Artistic-face-HQ Dataset (AAHQ)
parrt/tensor-sensor
mattbradley/dash
openai/whisper
- HN
- https://openai.com/blog/whisper/
https://banmo-www.github.io/
https://nn-512.com/
https://hypernerf.github.io/
babysor/MockingBird
- 5 秒内克隆您的声音并生成任意语音内容
- HN
https://mlconsole.com/
AminRezaei0x443/memory-efficient-attention
- Jax, PyTorch
Perceiver IO: a scalable, fully-attentional model that works on any modality
mchong6/JoJoGAN
facebookresearch/SLIP
- Self-supervision meets Language-Image Pre-training
alibaba/DeepRec
- recommendation engine
open-mmlab/mmdeploy
handtracking-io/yoha
facebookresearch/Detic
Machine-Learning-Tokyo/Interactive_Tools
ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
kmario23/deep-learning-drizzle
academic/awesome-datascience
trekhleb/homemade-machine-learning
donnemartin/data-science-ipython-notebooks
https://github.com/facebookresearch/mae
- Masked Autoencoders
https://github.com/naver-ai/c3-gan
kaegi/alass
- Automatic Language-Agnostic Subtitle Synchronization
SubSync: Subtitle Speech Synchronizer
- HN
gnes-ai/gnes
apache/tvm
facebookresearch/ConvNeXt
NVlabs/instant-ngp
lucidrains/RETRO-pytorch
libffcv/ffcv
johnnyzn/DW-GAN
- CAPTCHA
Kubasinska/MI-EEG-1D-CNN
google-research/circuit_training
Justin62628/Squirrel-RIFE
automl/auto-sklearn
open-mmlab/mmrotate
- Rotated Object Detection
google-research/frame-interpolation
- Frame Interpolation for Large Motion
patrick-kidger/diffrax
dynamite-ready/movie-parser
Nixtla/neuralforecast
- forecasting algorithms for time series data
ouhenio/stylegan3-projector
- StyleGAN3 + Inversion
victordibia/handtrack.js
- HN
facebookincubator/gloo
horovod/horovod
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet
facebookresearch/contriever
https://bishopfox.com/blog/unredacter-tool-never-pixelation
- unmask
jokenox/Goopt
- 文本内容生成
fastai/fastcore
GroupViT: Semantic Segmentation Emerges from Text Supervision
TorchStudio/torchstudio
replicate/cog Containers for machine learning
https://www.nvidia.com/en-us/studio/canvas/
https://pimeyes.com/
Parti: Pathways Autoregressive Text-to-Image Model
- HN
kuprel/min-dalle
- DALL E Mini PyTorch
borisdayma/dalle-mini
Keytap3: check if your keyboard can be eavesdropped through a microphone
iperov/DeepFaceLive
TencentARC/GFPGAN
- Real-world Face Restoration
google-research/multinerf
- Mip-NeRF 360, Ref-NeRF, and RawNeRF
Running your own A.I. Image Generator with Latent-Diffusion
- HN
Anjok07/ultimatevocalremovergui
- 移除人声
NVIDIAGameWorks/kaolin-wisp
- PyTorch library powered by NVIDIA Kaolin Core to work with neural fields
- NeRFs, NGLOD, instant-ngp and VQAD
WongKinYiu/yolov7
nnaisense/evotorch
YOLOv7 Breakdown
Musico: AI Generated Music
- HN
Adventure game graphics with DALL-E 2
- HN
VToonify
orgs
- NVlabs
- facebookresearch
- THUDM
  - 清华 KEG & 数据挖掘
music
- AI-Guru/music-generation-research
serving
- johnolafenwa/deepstack
upscale
- Araxeus/PNG-Upscale
  - MIT, Java
- IBM/MAX-Image-Resolution-Enhancer
  - Apache-2.0, Python
  - Docker
- upscayl/upscayl
  - AGPL-3.0
  - 需要 GPU
  - App 方式
  - Real-ESRGAN

Learn

Books

Probabilistic Machine Learning: An Introduction
An Introduction to Applied Bayesian Modeling https://www.bayesrulesbook.com/
- https://xianblog.wordpress.com/2022/07/05/bayes-rules-book-review/

Framework

minitorch/minitorch
microsoft/SynapseML
- MIT, Scala
- Distributed Machine Learning
caffe
flashlight/flashlight
- C++ standalone library for machine learning
- from Facebook AI Research Speech team, creators of Torch and Deep Speech
arrayfire/arrayfire
- BSD-3, C++
- general purpose GPU library
- Binding: Python, Rust, Julia, NIM
- WIP: .NET, Go, Java, Lua, JS, R, Ruby
google/jax
- Apache-2.0, Python, C++
- Autograd and XLA
- 基础计算框架
- google/flax
  - neural network
- deepmind/rlax
  - reinforcement learning
- deepmind/optax
  - gradient processing and optimization
- deepmind/dm-haiku
  - neural network
- deepmind/chex
tensorflow/lingvo
- building sequence models neural networks in Tensorflow
- ASR, MT
dlib
clab/dynet
CNTK
mlpack
SHARK
Armadillo
Faisis
OpenNN
FANN
bennylp/awesome-cpp-ml
Boosting
- XGBoost
- ThunderGBM
- LightGBM
- CatBoost
Web
- BrainJS/brain.js
  - MIT, TS
  - GPU accelerated Neural networks in JavaScript for Browsers and Node.js
- @tensorflow/tfjs
  - https://www.tensorflow.org/js/
  - tfjs-vis
  - @tensorflow/tfjs-node
  - @tensorflow/tfjs-node-gpu - Linux
  - tensorflow/tfjs-models
- spencermountain/compromise
  - MIT, JS
  - modest NLP
- ml5js/ml5-library
  - 基于 TensorFlow.js
  - Blue Oak Model License 1.0.0 modified
- NaturalNode/natural
  - MIT, JS
  - Tokenizer
  - String Distance
  - Stemmer
  - Bayesian & Logistic Regression Classifier
  - Maximum Entropy Classifier
  - Sentiment Analysis
  - WordNet - moos/wordnet-db
  - 无中文支持
  - NaturalNode/node-sylvester
    - vector, matrix, geometry for JS
  - NaturalNode/node-nltools
- retextjs/retext
- linonetwo/segmentit
  - 中文分词
- wagenaartje/neataptic
  - neuro-evolution & backpropagation
  - 不在维护
- cazala/synaptic
  - 不在维护
POS Tagger - part-of-speech tagger
- Eric Brill

PyTorch vs TensorFlow in 2022
- HN

Service

openvinotoolkit/cvat
- Powerful and efficient Computer Vision Annotation Tool (CVAT)

Language

JetBrains/KotlinDL
- Kotlin DSL for ML

Intrested

ZHKKKe/MODNet
- 背景消除
tonybeltramelli/pix2code
- GUI Screenshot -> Code

Models

BERT
Tacotron
Wavenet/Waveglow/WaveRNN
Eesen, Espresso, Kaldi, Wav2letter, NeMo
VGG’16
VGG’19
ResNet50
ResNet101
ResNet152
ResNet50v2
ResNet101v2
ResNet152v2
MobileNet
MobileNetv2
https://modelplace.ai/models
OpenBMB/BMList

GAN

orpatashnik/StyleCLIP

Music

microsoft/muzic

STT

snakers4/silero-models
alphacep/vosk-api
- Offline speech recognition API
- Python, Java, C#, Node
- 支持中文
- alphacep/vosk-asterisk
  - res-speech-vosk - Asterisk 集成
alphacep/vosk-android-demo
- Offline speech recognition for Android with Vosk library
kaldi-asr/kaldi
- Speech Recognition Toolkit
julius-speech/julius
- Open-Source Large Vocabulary Continuous Speech Recognition Engine
daanzu/kaldi-active-grammar
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
espnet/espnet
- End-to-End Speech Processing Toolkit
flashlight/wav2letter
- Facebook AI Research's Automatic Speech Recognition Toolkit
Nvidia/NeMo
- toolkit for conversational AI
- ASR, NLP, TTS
PaddlePaddle/PaddleSpeech
- ASR toolkit
- 百度 Deep Speech: Scaling up end-to-end speech recognition
mozilla/DeepSpeech
- 基于 Tensorflow
arjo129/uSpeech
- Speech recognition toolkit for the arduino
coqui-ai
- TTS
- STT - 没有中文模型
synesthesiam/voice2json
- 命令行工具
- 中文模型基于 pocketsphinx
- HN
NATSpeech/NATSpeech
- Non-Autoregressive Text-to-Speech (NAR-TTS) framework
cmusphinx
- 工作已经开始转移到 Kaldi, Vosk
- cmusphinx/pocketsphinx

术语

abbr	mean	desc
ASR	Automatic Speech Recognition
TTS	Text-to-speech
SE	Speech enhancement/separation
ST	Speech Translation
MT	Machine Translation
VC	Voice conversion

Hardware Platform

RTX
Colab Pro
Paperspace Pro

Machine Learning Awesome

Learn​

Framework​

Service​

Language​

Intrested​

Models​

GAN​

Music​

STT​

Hardware Platform​

Learn

Framework

Service

Language

Intrested

Models

GAN

Music

STT

Hardware Platform