Machine Learning Awesome
- GPT Awesome
- Stable Diffusion Awesome
- OCR Awesome
- GokuMohandas/MadeWithML
- josephmisiti/awesome-machine-learning
- wunderwuzzi23/mlattacks
- tangramdotdev/tangram
- facebookresearch/moco-v3
- facebookresearch/ParlAI
- JianshuZhang/WAP
- salesforce/warp-drive
- isl-org/MiDaS
- NVlabs/stylegan3
- PaddlePaddle/Knover
- google-research-datasets/timedial
- google-research-datasets/disfl-qa
- vijaydwivedi75/gnn-lspe
- microsoft/onnxruntime
- PyTorch 1.10 HN
- facebookresearch/salina
- facebookresearch/ppuda
- wuhaozhe/style_avatar
- facebook/prophet
- producing high quality forecasts for time series data
- facebookresearch/Kats
- analyze time series data
- deepinsight/insightface
- 2D and 3D Face Analysis
- VowpalWabbit/vowpal_wabbit
- frontier of machine learning
- wav2vec
- https://nv-tlabs.github.io/editGAN/
- bryandlee/animegan2-pytorch AnimeGANv2 HN
- yeemachine/kalidokit
- google-research/ibc
- Implicit Behavioral Cloning
- bhky/opennsfw2
- Yahoo Open-NSFW model
- eugeneyan/applied-ml
- PeterL1n/RobustVideoMatting
- yuval-alaluf/hyperstyle
- SysCV/pcan
- tinyfool/VideoRemoveBackground
- open-mmlab/mmhuman3d
- BaltiApps/Pixelify-Google-Photos
- sedthh/pyxelate
- facebookresearch/theseus
- onion-liu/aahq-dataset
- Artstation-Artistic-face-HQ Dataset (AAHQ)
- parrt/tensor-sensor
- mattbradley/dash
- openai/whisper
- https://banmo-www.github.io/
- https://nn-512.com/
- https://hypernerf.github.io/
- babysor/MockingBird
- 5 秒内克隆您的声音并生成任意语音内容
- HN
- https://mlconsole.com/
- AminRezaei0x443/memory-efficient-attention
- Jax, PyTorch
- Perceiver IO: a scalable, fully-attentional model that works on any modality
- mchong6/JoJoGAN
- facebookresearch/SLIP
- Self-supervision meets Language-Image Pre-training
- alibaba/DeepRec
- recommendation engine
- open-mmlab/mmdeploy
- handtracking-io/yoha
- facebookresearch/Detic
- Machine-Learning-Tokyo/Interactive_Tools
- ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
- kmario23/deep-learning-drizzle
- academic/awesome-datascience
- trekhleb/homemade-machine-learning
- donnemartin/data-science-ipython-notebooks
- https://github.com/facebookresearch/mae
- Masked Autoencoders
- https://github.com/naver-ai/c3-gan
- kaegi/alass
- Automatic Language-Agnostic Subtitle Synchronization
- SubSync: Subtitle Speech Synchronizer
- gnes-ai/gnes
- apache/tvm
- facebookresearch/ConvNeXt
- NVlabs/instant-ngp
- lucidrains/RETRO-pytorch
- libffcv/ffcv
- johnnyzn/DW-GAN
- CAPTCHA
- Kubasinska/MI-EEG-1D-CNN
- google-research/circuit_training
- Justin62628/Squirrel-RIFE
- automl/auto-sklearn
- open-mmlab/mmrotate
- Rotated Object Detection
- google-research/frame-interpolation
- Frame Interpolation for Large Motion
- patrick-kidger/diffrax
- dynamite-ready/movie-parser
- Nixtla/neuralforecast
- forecasting algorithms for time series data
- ouhenio/stylegan3-projector
- StyleGAN3 + Inversion
- victordibia/handtrack.js
- facebookincubator/gloo
- horovod/horovod
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet
- facebookresearch/contriever
- https://bishopfox.com/blog/unredacter-tool-never-pixelation
- unmask
- jokenox/Goopt
- 文本内容生成
- fastai/fastcore
- GroupViT: Semantic Segmentation Emerges from Text Supervision
- TorchStudio/torchstudio
- replicate/cog Containers for machine learning
- https://www.nvidia.com/en-us/studio/canvas/
- https://pimeyes.com/
- Parti: Pathways Autoregressive Text-to-Image Model
- kuprel/min-dalle
- DALL E Mini PyTorch
- borisdayma/dalle-mini
- Keytap3: check if your keyboard can be eavesdropped through a microphone
- iperov/DeepFaceLive
- TencentARC/GFPGAN
- Real-world Face Restoration
- google-research/multinerf
- Mip-NeRF 360, Ref-NeRF, and RawNeRF
- Running your own A.I. Image Generator with Latent-Diffusion
- Anjok07/ultimatevocalremovergui
- 移除人声
- NVIDIAGameWorks/kaolin-wisp
- PyTorch library powered by NVIDIA Kaolin Core to work with neural fields
- NeRFs, NGLOD, instant-ngp and VQAD
- WongKinYiu/yolov7
- nnaisense/evotorch
- YOLOv7 Breakdown
- Musico: AI Generated Music
- Adventure game graphics with DALL-E 2
- VToonify
- orgs
- NVlabs
- facebookresearch
- THUDM
- 清华 KEG & 数据挖掘
- music
- serving
- upscale
- Araxeus/PNG-Upscale
- MIT, Java
- IBM/MAX-Image-Resolution-Enhancer
- Apache-2.0, Python
- Docker
- upscayl/upscayl
- AGPL-3.0
- 需要 GPU
- App 方式
- Real-ESRGAN
- Araxeus/PNG-Upscale
Learn
- google-research/google-research
- https://ai.googleblog.com/
- https://www.kdnuggets.com/
- https://stanford.edu/~shervine/teaching/
Books
- Probabilistic Machine Learning: An Introduction
- An Introduction to Applied Bayesian Modeling https://www.bayesrulesbook.com/
- Practical Deep Learning for Coders 2022
- dair-ai/ML-YouTube-Courses
- ML and NLP Research Highlights of 2021
- Machine Learning Algorithms Cheat Sheet
- https://probml.github.io/pml-book/
- https://scikit-learn.org/stable/tutorial/machine_learning_map/index.html
- Neural network from scratch
- Distributional Reinforcement Learning
- A visual introduction to machine learning
- Evaluating Syntactic Abilities of Language Models
- RLDS: An Ecosystem to Generate, Share, and Use Datasets in Reinforcement Learning
- https://learnaifromscratch.github.io/ai.html
- microsoft/ML-For-Beginners
- microsoft/Data-Science-For-Beginners
- LabML Neural Networks
- PyTorch
- Reinforcement Learning: Theory and Algorithms
- How to replace estimations and guesses with a Monte Carlo simulation HN
- Neural Networks from Scratch
- How to train large deep learning models as a startup HN
- Red Hot: The 2021 Machine Learning, AI and Data (MAD) Landscape
- Guide for building an End-to-End Logistic Regression Model
- SELF-PARKING CAR IN 500 LINES OF CODE
- facebookresearch/minihack
- Using Machine Learning to Denoise Images for Better OCR Accuracy
- Introduction to AutoEncoder and Variational AutoEncoder
- SimVLM: Simple Visual Language Model Pre-training with Weak Supervision
- Training a DCGAN in PyTorch
- Self-Supervised Reversibility-Aware Reinforcement Learning
- https://arxiv.org/pdf/2109.02869.pdf
- https://arxiv.org/abs/2106.01345
- http://proceedings.mlr.press/v139/vicol21a.html
- https://arxiv.org/abs/2010.01412
- Machine Learning Research Papers Released In 2021
- Interesting Algorithms Released By Google AI In 2021
- A visual introduction to machine learning
- https://brilliant.org/courses/intro-neural-networks/introduction-65/
Framework
- minitorch/minitorch
- microsoft/SynapseML
- MIT, Scala
- Distributed Machine Learning
- caffe
- flashlight/flashlight
- C++ standalone library for machine learning
- from Facebook AI Research Speech team, creators of Torch and Deep Speech
- arrayfire/arrayfire
- BSD-3, C++
- general purpose GPU library
- Binding: Python, Rust, Julia, NIM
- WIP: .NET, Go, Java, Lua, JS, R, Ruby
- google/jax
- Apache-2.0, Python, C++
- Autograd and XLA
- 基础计算框架
- google/flax
- neural network
- deepmind/rlax
- reinforcement learning
- deepmind/optax
- gradient processing and optimization
- deepmind/dm-haiku
- neural network
- deepmind/chex
- tensorflow/lingvo
- building sequence models neural networks in Tensorflow
- ASR, MT
- dlib
- clab/dynet
- CNTK
- mlpack
- SHARK
- Armadillo
- Faisis
- OpenNN
- FANN
- bennylp/awesome-cpp-ml
- Boosting
- XGBoost
- ThunderGBM
- LightGBM
- CatBoost
- Web
- BrainJS/brain.js
- MIT, TS
- GPU accelerated Neural networks in JavaScript for Browsers and Node.js
- @tensorflow/tfjs
- https://www.tensorflow.org/js/
- tfjs-vis
- @tensorflow/tfjs-node
- @tensorflow/tfjs-node-gpu - Linux
- tensorflow/tfjs-models
- spencermountain/compromise
- MIT, JS
- modest NLP
- ml5js/ml5-library
- 基于 TensorFlow.js
- Blue Oak Model License 1.0.0 modified
- NaturalNode/natural
- MIT, JS
- Tokenizer
- String Distance
- Stemmer
- Bayesian & Logistic Regression Classifier
- Maximum Entropy Classifier
- Sentiment Analysis
- WordNet - moos/wordnet-db
- 无 中文 支持
- NaturalNode/node-sylvester
- vector, matrix, geometry for JS
- NaturalNode/node-nltools
- retextjs/retext
- linonetwo/segmentit
- 中文分词
- wagenaartje/neataptic
- neuro-evolution & backpropagation
- 不在维护
- cazala/synaptic
- 不在维护
- BrainJS/brain.js
- POS Tagger - part-of-speech tagger
- Eric Brill
Service
- openvinotoolkit/cvat
- Powerful and efficient Computer Vision Annotation Tool (CVAT)
Language
- JetBrains/KotlinDL
- Kotlin DSL for ML
Intrested
- ZHKKKe/MODNet
- 背景消除
- tonybeltramelli/pix2code
- GUI Screenshot -> Code
Models
- BERT
- Tacotron
- Wavenet/Waveglow/WaveRNN
- Eesen, Espresso, Kaldi, Wav2letter, NeMo
- VGG’16
- VGG’19
- ResNet50
- ResNet101
- ResNet152
- ResNet50v2
- ResNet101v2
- ResNet152v2
- MobileNet
- MobileNetv2
- https://modelplace.ai/models
- OpenBMB/BMList
GAN
Music
STT
- snakers4/silero-models
- alphacep/vosk-api
- Offline speech recognition API
- Python, Java, C#, Node
- 支持中文
- alphacep/vosk-asterisk
- res-speech-vosk - Asterisk 集成
- alphacep/vosk-android-demo
- Offline speech recognition for Android with Vosk library
- kaldi-asr/kaldi
- Speech Recognition Toolkit
- julius-speech/julius
- Open-Source Large Vocabulary Continuous Speech Recognition Engine
- daanzu/kaldi-active-grammar
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
- espnet/espnet
- End-to-End Speech Processing Toolkit
- flashlight/wav2letter
- Facebook AI Research's Automatic Speech Recognition Toolkit
- Nvidia/NeMo
- toolkit for conversational AI
- ASR, NLP, TTS
- PaddlePaddle/PaddleSpeech
- ASR toolkit
- 百度 Deep Speech: Scaling up end-to-end speech recognition
- mozilla/DeepSpeech
- 基于 Tensorflow
- arjo129/uSpeech
- Speech recognition toolkit for the arduino
- coqui-ai
- TTS
- STT - 没有中文模型
- synesthesiam/voice2json
- 命令行工具
- 中文模型基于 pocketsphinx
- HN
- NATSpeech/NATSpeech
- Non-Autoregressive Text-to-Speech (NAR-TTS) framework
- cmusphinx
- 工作已经开始转移到 Kaldi, Vosk
- cmusphinx/pocketsphinx
术语
abbr | mean | desc |
---|---|---|
ASR | Automatic Speech Recognition | |
TTS | Text-to-speech | |
SE | Speech enhancement/separation | |
ST | Speech Translation | |
MT | Machine Translation | |
VC | Voice conversion |
Hardware Platform
- RTX
- Colab Pro
- Paperspace Pro