Latest revision as of 23:46, 27 March 2026

人類如何勝過AI

人類如何勝過AI？
以 Anthropic 為例，工程師如今不再從零撰寫程式碼，而是將任務交給 Claude 生成初稿，人類角色則轉為提出需求、審查程式碼與把關整體架構，更接近產品經理或系統架構師。

Understand AI

Distill was built to fix a broken system: unreadable research papers.

Prompts

Claude

Prompting best practices, Claude 4.6 提示詞指南：掌握 6 大原則讓 AI 變更強

原則一：把 AI 當成「很聰明的新同事」
原則二：直接給範例，比空講規則更有效
原則三：說「要做什麼」，不要只說「不要做什麼」
原則四：處理長文件時，把資料放前面、問題放最後
原則五：新版模型更聰明，也更「主動」
原則六：想要更深度推理，可以要求「先自我檢查」

Prompt mistakes

No role assignment: Act as a XXX
Asking for too much at once: First, outline the XXX. Then we'll tackle each one.
Not specifying what to avoid: Avoid XXX
Not providing examples: Share a paragraph you want and say "Write in this style."

Research prompts

Learning prompts

ChatGPT Role-Playing Prompts
- New report shows the truth of how people actually use AI chatbots
- ChatGPT Role-Playing Prompts: A Guide for Editors

How to learn to code FAST using ChatGPT
- Give me a study plan to learn python for data science
- Give me a study plan to learn python for data science with resources and a timeline
- Sublime is used
- (After ask a question and get an answer). Let's take this step by step.

Ask generative AI to be that colleague. Ask 'As a physicist, describe how cancer cells interact with their environment', or 'As a chemist..', 'As a developmental biologist..', 'As an economist..' 'As an electrician.' ...

I Tried Thousands of ChatGPT Prompts, and These 4 Saved Me Hours (No BS)
- Learn any skill, fast: I want to learn about [insert topic]. Tell me the most critical 20% that helps me understand 80%
- Find the best resources: Give me the best learning resources (books, videos, podcasts, courses) for [topic]. Make sure they fit different learning styles, especially visual learners
- Simplify anything you don't understand: Explain [concept] like I'm 5
- Perform like a pro in any profession: Act as a world-class copywriter with 10+ years of experience. Help me write a blog post about [topic]

Creating images

5 of these 10 photos are AI-generated — can you spot them?

Interesting prompts

Can you tell me everything you know about me, based on our past conversations?

Carbon footprint

Free AI isn’t sustainable — and we’ll be paying for it soon enough.

ChatGPT

https://chat.openai.com, https://openai.com/blog/chatgpt-plus/

https://openai.com/blog/chatgpt/
Sign Up to OpenAI without Your Phone Number | OpenAI SMS Verification,
https://beta.openai.com/overview, examples
Tip: use SHIFT + ENTER to add a line break for entering some code.
This AI chatbot is dominating social media with its frighteningly good essays 2022/12/5
how it actually works?
ChatGPT is incredibly limited, but good enough at some things to create a misleading impression of greatness
Plagiarism is an outdated concept for AI
The End of High-School English
chatGPT google extension -A browser extension to display ChatGPT response alongside search engine results
GPT-1 to GPT-4: Each of OpenAI's GPT Models Explained and Compared
9 Communities for Beginners to Learn About AI Tools
NIH-CIT. ChatGPT 101, ChatGPT 102

Down

Network error

Network recommendations for ChatGPT errors on web and apps

Differences among platforms

8 ChatGPT Features You Can't Access on All Platforms

Settings

Skills

Codex

Plugins

How to Enable ChatGPT’s Web Browsing and Plugins

Use

Presentation/Powerpoint
- The 7 Best Tools That Use AI to Make Presentations for You
- https://mindshow.fun/ 快速演示你的想法 Auto-generated Slides
- How to Create a PowerPoint Using AI
- Adobe's new tool might just replace PowerPoint

Learn a language
- How ChatGPT Plus Can Help You Learn a Language
- How to Use ChatGPT's Conversational Mode to Practice a Language

Some examples:
- Andrew Ng
- Bioinformatics
- ChatGPT can Create Datasets, Program in R… and when it makes an Error it can Fix that too!
- It can also update code to more modern syntax as long as the syntax predates April 2021
- http://rtutor.ai/ which is built based on R's openai package.
  - Bioinformatics analysis using Chatlize and ChatGPT by Pr. Steven Ge | Tunis R User Group |Workshop 3
- 15 Creative Ways to Use ChatGPT by OpenAI
- Setting up macOS as an R data science rig in 2023
- Describe 20 possible generative AI use cases in detail across society that could create early impact.

Live voice

7 Interesting Ways You Can Use ChatGPT's Live Voice and Vision

Reasoning

How I Know When to Use ChatGPT Search vs. ChatGPT Reasoning

Deep research

API, Extension tools

https://platform.openai.com/account/api-keys, usage
A Complete Guide to the ChatGPT API
ChatBox, 开源的 ChatGPT API 跨平台桌面客户端，Prompt 的调试与管理工具，实现 ChatGPT Plus 的免费平替
Merlin ChatGPT Assistant for all Websites
ChatGPT Writer - Write mail, messages with AI
Tokens
- OpenAI Tokenizer & ChatGPT still can't answer this simple question
- What Is the ChatGPT Token Limit and Can You Exceed It?
- https://openai.com/api/pricing/. 0.04 - 2¢ per 1k tokens/words (language models). You can think of tokens as pieces of words, where 1,000 tokens is about 750 words. How much does the OpenAI’s cost per session? About $0.05 to $0.1, if you send 25 to 50 requests. $18.00 free trial. Cost depends on how many words in the input and output at $0.000002 per word.
- Unofficial Guide to OpenAI API Keys

Create your GPT

How I Stopped Procrastinating with ChatGPT — 2 Hours Saved Each Day!

call from R

Call ChatGPT (or really any other API) from R
openai-This R package provides an SDK to the Open AI API
openai package from CRAN & github
askgpt package
Shiny on Hugging Face
chattr package -Interact with Large Language Models in 'RStudio'.
ollamar package
- Use R to prompt a local LLM with ollamar

ellmer.

Setting up local LLMs for R and Python 2025/8/19
I tested it on Manjaro OS VM with 4GB ram and 4 cpu.

sudo pacman -S openssh
sudo systemctl start sshd
sudo systemctl enable sshd

sudo pacman -S lapack blas
sudo pacman -Sy r
sudo pacman -S base-devel

R

install.packages("ellmer", repos = "https://cloud.r-project.org")
library(ellmer)
chat <- chat_ollama(model = "llama3.2:1b")
chat$chat("Tell me a joke")
live_console(chat) # seems not working on ollama

Connect to LM Studio for local hosting
Harnessing Azure OpenAI and R for Web Content Summarisation: A Practical Guide with rvest and tidyverse
Three experiments in LLM code assist with RStudio and Positron
The Modern R Stack for Production AI
Chat with LLMs on your R environment. Google allows to use an api key for free.

call from Python

Jupyter-ai

A generative AI extension for JupyterLab

GPT-4

GPT o1

What Is ChatGPT's o1 Model and How Can You Use It?

GPT-4o

Alternatives

The 3 Best Alternatives to ChatGPT 1/6/2023
7 ChatGPT AI Alternatives (Free and Paid) 3/1/2023

PDF

chatPDF. ChatPDF allows you to use it for free with 3 PDFs every day, each up to 120 pages. It seems I can use chatPDF as usual chatGPT after I upload/paste URL of a pdf file without signing in my account.
pdfGPT
6 ChatGPT Apps to Analyze and Chat With Your Documents and PDFs
7 AI Tools That Answer Questions From Your PDFs
4 Ways AI Can Make Working With PDFs Easier
5 AI Tools to Analyze PDFs For Free. ChatGPT, Claude, Perplexity AI, Copilot, HuggingChat.
biorecap: an R package for summarizing bioRxiv preprints with a local LLM
You Can Now Chat With Your PDFs in Google Drive—Here’s How

Word

How to Automate Your Document Creation With ChatGPT in Microsoft Word
OnlyOffice AI assistants
- AI Document Editing: Connect GPT4All to ONLYOFFICE on Ubuntu
- It works. I tested on a Ubuntu Mate 24.04.1 VM (4 Host CPUs) and 8GB RAM. I am using the Llama 3.2 3B instruct model. The AI settings allow us to possibly select different models for different tasks: 1) Ask AI, 2) Summarization, 3) Translation, 4) Text analysis. We can also select to rewrite differently or make the text longer or shorter.
- Note that the original text could be overwritten by AI.
- Unlocking the Power of AI in ONLYOFFICE
- ONLYOFFICE + LocalAI: AI Document Editing Setup on Ubuntu

Government

Research

https://elicit.org/. Trained on scientific papers, and not only biomedical ones (providing real references!)
- How to use Elicit for topics that have lots of research
The 6 Best AI Tools for Researchers and Teachers
- Research Rabbit
- Gradescope
- Education Copilot
- ReadCube Papers
- Consensus
- Elicit
Scispace https://typeset.io/
Fake citations?, WebChatGPT: ChatGPT with internet access.
- To get references to journal articles, add "site:scholar.google.com" before your question.
- You can do the same exercise with another scholarly database like PubMed. site:pubmed.ncbi.nlm.nih.gov What are the major causes of lung cancer?

Content writer

7 Responsible Ways to Use AI as a Content Writer or Editor
How to Use ChatGPT to Transform Writing Into Another Format
- Turning A Blog Post Into a YouTube Script
- Changing a Technical Document Into a Popular Article
- Turning a Short Story Into a Movie Script
How to Write a Great Essay With ChatGPT Without Cheating

Meeting notes

Otter AI

Detect AI text

Youtube summary

Chrome extension YouTube Summary with ChatGPT from 8 AI-Powered Chrome Extensions to Summarize YouTube Videos

AutoGPT

How to Download and Install Auto-GPT Step-by-Step

Other chats

4 AI Search Engines I Use Every Day. Perplexity, Exa, You AI, Andi AI.
20 AI Websites That Feel Illegal to Use in 2026
- Perplexity: AI search engine that provides instant answers with sources.
- Leonardo AI: Image studio for creating product photos, characters, and 3D textures.
- Gamma: Tool for turning ideas into slide presentations automatically.
- ElevenLabs: AI voice generator for cloning voices and text-to-speech.
- Recraft: Professional vector-quality image generator for logos and icons.
- ChatGPT: Versatile AI for coding, writing, and image recognition.
- Sora: Hollywood-level AI video generator.
- Runway Gen-2: Advanced AI video editing and cinematography tool.
- Claude: Reasoning-focused AI for long documents and research.
- BlackBox AI: Coding assistant that debugs and explains code.
- Canva Magic Studio: AI-powered design tool for rewriting and editing images.
- Tome: Tool for creating interactive presentations and stories.
- Vrew: AI-powered video editing with auto-subtitles and cuts.
- HeyGen: Talking avatar generator with high lip-sync accuracy.
- Descript: Video and audio editor that works like a text document.
- Midjourney: AI art generator for unique visual styles and creativity.
- Pika Labs: AI animation specialist for short movement and transitions.
- OpusClip: Tool for turning long videos into viral social media shorts.
- Glide AI: No-code app builder for creating mobile and CRM tools.
- Luma AI: Tool for turning photos into photorealistic 3D scenes.

Claude

https://claude.ai/login, https://claude.ai/chat/. Like ChatGPT, there is no internet access.
For free plan, you can use it for a few question in a certain time period.
You can upload an image and ask it to give a story.
Claude vs. ChatGPT: Which LLM Is Best for Everyday Tasks?
I've Ditched ChatGPT for This Superior Alternative: 3 Reasons Why
I Can't Stop Watching This AI Chatbot Play Pokémon
Skills

Claude Code

Best Practices for Claude Code

Claude Artifacts

Claude Artifacts vs ChatGPT Canvas: Which Is Better?

Google Gemini

Bard now helps you code 4/21/2023
You Can Now Connect Bard to Gmail, Google Docs, YouTube, and More
Google's Newest AI Tool Helps You Choose Your Perfect Career
Google Has Dropped the Paywall for These Gemini Features 3/13/2025
- Gems
- Deep Research
- Gemini 2.0 Flash model
Google Gemini Can Now Turn Almost Anything Into a Podcast
Google 放大招！ Gemini 2.5 Pro 震撼發布，程式碼能力太強了，完爆 Claude 3.7 ？免費實測效果！

Google AI Studio

https://aistudio.google.com/

Google AI

NotebookLM

NotebookLM is Google's tool for building Retrieval-Augmented Generation (RAG) systems without coding.
I'm actually reading my read-it-later list thanks to this brilliant NotebookLM trick

New Features in NotebookLM
- Featured Notebooks: Expert-created templates that showcase best practices and help users learn how to build their own notebooks.
- Discover Sources: A new button that suggests curated, high-quality sources (e.g., from universities or news outlets) to enrich your notebook.
- Quizzes: Automatically generated quizzes based on your sources, with instant feedback and customizable difficulty, topic, and language.
- Flashcards: 60 default cards for memorizing key concepts, with options to customize and request explanations.
- Mindmaps: Interactive visual summaries of your sources, showing branching relationships between concepts. Not yet editable, but shareable.
- Audio Overview: Create podcasts in multiple styles and languages, with prompts to guide topic focus and length.
- Video Overview: Generate slide-based videos from your sources, structured into chapters and customizable by topic and language.
- Slide deck: they are static images. The complete "Editable NotebookLM Slides" solution. DeckEdit 目前最強 NotebookLM 簡報轉可編輯 PPTX、PDF 的免費工具.
How RAG Works
- Query Encoding: The user's question is converted into a vector (a mathematical representation).
- Document Retrieval: The system searches a database or document set for the most relevant matches.
- Context Injection: Retrieved documents are inserted into the model’s prompt.
- Response Generation: The model uses both its training and the retrieved context to generate a response.
Why RAG Is Useful
- Up-to-date answers: It can pull in current or domain-specific info not included in the model’s training.
- Custom knowledge bases: You can feed it your own documents (e.g., PDFs, research papers, manuals).
- No retraining needed: It improves accuracy without modifying the model itself.
Example Use Cases
- Scientific research assistants (like in phosphoproteomics 🧬)
  - Ask the question: What are three recurring ideas throughout these texts/documents
- Customer support bots using internal documentation
- Legal or medical AI tools referencing case files or journals

Illuminate

https://illuminate.google.com/ Transform your content into engaging AI‑generated audio discussions
If you like NotebookLM, you're going to love this new Google tool. Its primary focus seems to be converting research papers into five-minute audio discussions. Despite being built around the idea of converting research papers into Audio Overviews, Illuminate, in hindsight, lets you convert any web content into audio discussions as long as it isn't paywalled, from a site that has opted out of indexing, or contains content that violates Illuminate's safety filters.

Opal

I built my own app using Gemini and it's easier than Antigravity

Microsoft Copilot

Microsoft Copilot Tips and Tricks to Boost Your Productivity

perplexity.ai

Perplexity Assistant

Can't Afford ChatGPT Operator? Try Perplexity Assistant Instead

Multiple AI Chatbots

Poe
Monica
Perplexity

Grok

https://grok.com/. Designed by xAI.

Groq

https://groq.com/. https://console.groq.com/home is blocked.
Wikipedia

Deepseek

DeepSeek R1 vs V3: A Head-to-Head Comparison of Two AI Models
DeepSeek Coder: Let the Code Write Itself
Building DeepSeek R1 from Scratch Using Python

Qwen

https://chat.qwen.ai/

文心一言

https://yiyan.baidu.com/

Duck.ai

https://duck.ai

Proton Lumo

Brave AI chatbot: Leo

Everything You Need to Know About Leo: Brave Browser’s AI Chatbot

You.com

You.com’s AI-infused Google rival provides a tantalizing glimpse of the future

Mistral/Le Chat

Trae AI

Open source chats

HuggingChat
- GPT4All now supports 100+ more models! Sideloading any ggML model
- HuggingChat vs. Bing Chat: Which Is the Better ChatGPT Alternative?
Run Your Own AI Chat GPT-3 On Your Computer llama
1 Command To Bring Home Your Very Own Robot Overlord - Text Generation Webui
- oobabooga
How to Run a Large Language Model on Linux (and Why You Should)
- dalai

Export, import

You can finally bring your ChatGPT and Claude chats to Gemini

Run locally

List of Different Ways to Run LLMs Locally
4 free AI chatbots you can run directly on your PC
Anyone Can Enjoy the Benefits of a Local LLM With These 5 Apps
- Ollama
- Msty
- AnythingLLM
- Jan.ai
- LM Studio

Hardware guide

The Complete Guide to Local LLM Hardware: Specs for Running AI Models on Consumer Hardware
- A 7 billion-parameter model in full 16-bit precision requires 14 GB. The same model at 8-bit quantisation needs 7 GB. At 4-bit, roughly 3.5 GB.
- For consumer hardware, 4-bit quantisation is the standard for anything larger than 7 billion parameters.
- RTX 3060 12GB 7-10t/s, RTX 4060 Ti (16GB) 12-15t/s, RTX 4090 (24GB) 20-30t/s

Models

Stop guessing which local LLMs run on your PC—this open-source tool can tell you

Jan.ai

Jan, https://github.com/janhq/jan
ChatGPT 最佳免费替代软件！支持本地离线运行，100%免费开源，兼容多种主流AI大模型！.
Llama 3 正式发布！性能强悍，支持AI文生图，完全免费开源！附本地安装教程！
https://jan.ai/docs/local-api. Local Server Address: By default, Jan is only accessible on the same computer it's running on, using the address 127.0.0.1. You can change this to 0.0.0.0 to let other devices on your local network access it. However, this is less secure than allowing access from the same computer.

LM Studio

LM Studio.
- LM Studio Link. I Turned My 16GB Mac Mini Into an AI Powerhouse — Here's How LM Studio Link Changed Everything
Run local LLMs with ease on Mac and Windows thanks to LM Studio
Llama3 一键本地部署！无需GPU ！100% 保证成功，轻松体验 Meta 最新的 8B、70B AI大模型！
R
- Summarising Top 100 UK Climbs: Running Local Language Models with LM Studio and R
How to run DeepSeek and other LLMs locally on your Mac
6 ways anyone can use LM Studio and a local LLM on their PC
I don’t need Perplexity anymore because my local LLM does it better
- 8 GB RTX 4060, 16GB LPDDR5X memory
- quantized to 4-bit precision -> 25 to 30 tokens per second
- Llama 3.1 70B and DeepSeek R1 distilled models -> get GPT-4 performance
- Qwen 2.5 instance generates 25 to 30 tokens per second, which is almost half of what cloud-based GPT-4 can spit out
- Live web search is what you'll miss the most
Plugin
- DuckDuckGo plugin for web search.

Anything LLM

https://anythingllm.com/
https://github.com/Mintplex-Labs/anything-llm
AnythingLLM supports Qualcomm Hexagon NPU on Qualcomm Snapdragon X systems. The great NPU failure: Two years later, local AI is still all about GPUs
DeepSeek-R1最佳本地用法！免费开源，无痛运行高级 AI 大模型，秒建私人知识库
How to build your own AI bot to answer questions about your documents

Msty

https://msty.app/
How to build your own AI bot to answer questions about your documents
- Anything LLM took 10 to 15 minutes to embed a PDF file with around 150 pages in the test. Msty, on the other hand, often took three to four times as long.

Ollama

https://github.com/ollama/ollama
- FAQ like How do I configure Ollama server? Environment="OLLAMA_HOST=0.0.0.0"
- For example when I try Llama 3.2 1B model on 4GB (now I extend it to 8GB) Manjaro VM using 4 vCPU, the total memory including the xfce desktop is 2.27G.
Issue: Did not get a response.
- If it took too long, I can use Ctrl+C to stop.
- Even after I quit ollama, a "ollama runner" process is still running. So I run "ps -ef | grep ollama". We can use ollama stop MODEL_NAME. See How do I keep a model loaded in memory or make it unload immediately? in FAQ.

My notes. llama3.1:8b is better than Phi3/Phi4 (14b).

$ ollama list
NAME               ID              SIZE      MODIFIED    
qwen2:1.5b         f6daf2b25194    934 MB    6 days ago     
phi3:3.8b          4f2222927938    2.2 GB    6 days ago     
llama3.1:8b        46e0c10c039e    4.9 GB    6 days ago     
llama3.2:latest    a80c4f17acd5    2.0 GB    6 days ago     
llama3.2:1b        baf6a787fdff    1.3 GB    2 weeks ago

$ ollama pull llama3.1:8b

$ ollama run --verbose qwen2:1.5b
>>> what is lincoln memorial
...
total duration:       1m14.068603383s
load duration:        19.23796ms
prompt eval count:    13 token(s)
prompt eval duration: 2.348s
prompt eval rate:     5.54 tokens/s
eval count:           297 token(s)
eval duration:        1m11.699s
eval rate:            4.14 tokens/s
>>> /bye

$ ollama run --verbose phi3:3.8b
>>> what is lincoln memorial
...
total duration:       1m33.270810903s
load duration:        14.566152ms
prompt eval count:    15 token(s)
prompt eval duration: 7.383s
prompt eval rate:     2.03 tokens/s
eval count:           160 token(s)
eval duration:        1m25.872s
eval rate:            1.86 tokens/s
>>> /bye

Ollama Guidance for Effective Use
Vision:
- Llama 3.2 Vision
- How to Run Llama 3.2-Vision Locally With Ollama: A Game Changer for Edge AI
If you want to change the default location where Ollama saves its models, you can set the OLLAMA_MODELS environment variable to your desired directory. To do this:
- Open a terminal
- Run: sudo systemctl edit ollama.service
- Add the following line under the [Service] section & Save and exit the editor: Environment="OLLAMA_MODELS=/path/to/new/location"
- Reload the daemon: sudo systemctl daemon-reload
- Restart Ollama: sudo systemctl restart ollama
GPU:
- Running Ollama on Your Local Machine with NVIDIA GPUs
- Self-hosting Llama 3 on a home server
Model file
- https://github.com/ollama/ollama/blob/main/docs/modelfile.md A model file is the blueprint to create and share models with Ollama
Raspberry Pi 5:
- Running Open LLM Models on Raspberry Pi 5 with Ollama
- Run LLMs Locally on Raspberry Pi Using Ollama AI
Alpaca: A Linux GUI App to Manage Multiple AI Models Offline

Command line options

The 7 Ollama Commands That Separate Hobbyists From Power Users

Terminal

I turned my Linux terminal into a local AI assistant and it’s so useful

explain() {
input=$(cat)
ollama run llama3.2:3b "$* $input"
}

ask() {
ollama run llama3.2:3b "$*"
}

VS Code

Create your own and custom Copilot in VSCode with Ollama and CodeGPT
- CodeGPT Quick Start
Step-by-Step: Running DeepSeek locally in VSCode for a Powerful, Private AI Copilot
I built a local coding AI for VS Code and it’s shockingly good. LM Studio + continue extension.

OpenWebUI

(2025/7/31) Ollama desktop is now available. Ollama 0.10 Speeds up Local AI Models, Introduces Desktop App.
https://github.com/open-webui/open-webui
Mac
- Install Ollama
  - Download Ollama for Mac. After unzipping it, drag the file to the Application folder. Then double clicking the Ollama app to start the installation.
  - Command line way: ollama run --verbose llama3.2
- Install Open WebUI
```
$ brew install [email protected]
$ python3.11 -m venv ollamavenv
$ source ollamavenv/bin/activate
(ollamavenv) $ pip install open-webui
(ollamavenv) $ open-webui serve  # OR open-webui serve --port 8080
(ollamavenv) $ deactivate
```
  Create username, email (eg [email protected]) and password. There is no email verification, and it’s only stored locally, so the email is just an identifier for login. You’ll only need to log in again if you clear your browser cache or reset the database. The only thing that matters is: You remember the email + password you entered (you’ll need it to log in again later).
  Go to http://localhost:8080 to see the Open WebUI.
  The Ollama and llama3.2 was automatically recognized and ready to use.
Add a Non-Ollama Backend
- Go to Settings > Model Providers
- Click "Add Provider"
- Choose "OpenAI-compatible"
- Enter the base URL (e.g., http://localhost:1234/v1)
- Provide the API key (if needed) — for local setups you can use sk-fake-key
Docker. How I Built a Self-Hosted AI Server in 5 Minutes (And You Can Too!). Deploying Ollama with Open WebUI Locally: A Step-by-Step Guide.
```
docker run -d \
  -p 3000:8080 \
  -v ollama:/root/.ollama \
  -v open-webui:/app/backend/data \
  --name open-webui \
  --restart always \
  ghcr.io/open-webui/open-webui:ollama
```
- "ollama" and "open-webui" are volume names since they are not in a directory format. Some useful commands are: "docker volume ls", "docker volume inspect ollama". If you wanted the folders to appear in your current directory, you would use a bind mount, which specifies an absolute path to a folder on your host machine.
- To find each volume size, use docker system df -v
- To download a model, go to "Admin Panel" - Settings - Models. Click the download button (Manage models). In the "Pull a model from Ollama.com" field, enter a tag like "deepseek-r1:1.5b" or "gemma3:1b" or "gemma3:4b" and hit the "Pull model" button. Another way is to type a tag in the "select a model" (down arrow icon) field on the UI and let it pull from ollama.com. See https://ollama.com/library for available models. On my 8GB VM and 4 vcpus, the token response speed is 5.5/s.

Claude code

Python

I Built a Fully Offline AI Agent That Answers Questions From PDF, Images, and Audio — No Cloud…

Cherry Studio

https://www.cherry-ai.com/ (38k star). AI Agent + Coding Agent + 300+ assistants: agentic AI desktop with autonomous coding, intelligent automation, and unified access to frontier LLMs.
白嫖顶级AI模型！GLM-4.7 + MiniMax M2.1 免费 API

Locally

https://locallyai.app/ Run AI models locally on your iPhone, iPad, and Mac.

GPT4ALL

Phi-3 开源大模型本地部署！能否媲美 ChatGPT、Cladue 3？
Here's How To Install Your Own Uncensored Local GPT-Like Chatbot
It can download models from two sources: GPT4ALL and HuggingFace (no guarantee it will work).
My testing:
- A new directory "gpt4all" was created under the home directory. The GUI can be launched from the command line ~/gpt4all/bin/chat or from a desktop icon.
- Use my user account to install it, not to use 'sudo'. The installation will create a folder 'gpt4all' under my home directory.
- VM is not working if we use vCPU. It shows Encountered an error starting up: "Incompatible hardware detected." Unfortunately, your CPU does not meet the minimal requirements to run this program. In particular, it does not support AVX intrinsics which this program required to successfully run a modern large language model. ... The solution is to use edit the hardware to use host cpu.
When we launch GPT4ALL, it will check if a new version is available. If a new version is available, it will offer to upgrade.
Comparison of GPT4ALL, LM Studio and Ollama

Feature	GPT4ALL	LM Studio	Ollama
Model Compatibility	Vicuna, Alpaca, LLaMa	Wide range including Vicuna, Alpaca, LLaMa, Falcon, Starcoder, GPT-2	Various models, seamless workflow integration
User Interface	User-friendly GUI	More UI-friendly, in-app chat interface	Simple command-line interface, various web-based clients available
Performance	Good for lower-end systems	Generally faster inference, more coherent responses	Optimized for speed, rapid inference times
Resource Utilization	Efficient on consumer-grade hardware	May require more resources for larger models	Can be resource-intensive for larger models
Customization	Basic	Advanced (e.g., adjustable parameters)	Flexible, allows creating custom models
Acceleration Support	Not specified	CUDA, openCL, cuBLAS, Metal	Not specified
Open Source	Yes	No (free to download)	Yes
OS Support	Cross-platform	macOS, Windows (with AVX2), Linux (beta)	macOS, Linux, Windows (preview)
Key Features	RAG capabilities, wide hardware support	Built-in chat interfaces, OpenAI-like local servers	Simplicity, ease of installation, suitable for beginners
Developer Tools	Python bindings, API	Local inference server	Command-line interface, API

Remote access

Open-Webui

Images

Stop Paying for AI Images: These 4 Free Open-Source Tools are Better

Documents

Khoj

https://github.com/khoj-ai/khoj Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI
I started using a self-hostable AI research app and I should have sooner. NotebookLM alternative.

PrivateGPT

https://github.com/zylon-ai/private-gpt (56.7k star)

DocsGPT

https://github.com/arc53/DocsGPT (17.2k star)

Harper to replace grammarly

Models

Performance evaluation

Meta's LLaMA

Introducing Meta Llama 3: The most capable openly available LLM to date 2024/4/18
Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU! 2024/4/21. AirLLM. It’s not designed for real-time interactive scenarios like chatting, more suitable for data processing and other offline asynchronous scenarios.
Create a free Llama 3.1 405B-powered chatbot on a GitHub repo in <1 min

BERT

Build LLM

AI agent

OpenClaw

LangChain

Build context-aware reasoning applications

https://github.com/langchain-ai/langchain
https://en.wikipedia.org/wiki/LangChain
Introduction to LangChain
Generative AI with LangChain, RStudio, and just enough Python
OpenAI has introduced a file upload capability that allows users to upload various file types, such as PDFs, CSVs, and PowerPoint presentations, directly to their platform for analysis. However, for more advanced or customized applications, developers often turn to frameworks like LangChain. LangChain provides tools to parse and process different file types, integrate with large language models (LLMs), and build sophisticated workflows tailored to specific needs. For instance, LangChain offers document loaders to handle various file formats and chains to analyze documents in a structured manner.

AI Browser

List

ChatGPT Atlas
Comet browser from Perplexity
Opera Is the First Browser to Support Local AI LLMs
https://pinokio.co/ & Pinokio
Dia from Arc.
- I thought this AI browser was awful until one update flipped everything

Reviews

8 Best Agentic AI Browsers in 2025

AI, ML and DL

AI, ML and DL: What’s the Difference?

Applications

General Applications

人何時走完全未知？美研發AI預測臨終準確度達90％
美國FDA首次批准AI醫療儀器上市，能自動即時偵測糖尿病視網膜病變
在家養老-科技幫大忙
病理研究有新幫手，Google以AR顯微鏡結合深度學習即時發現癌細胞
This New App Is Like Shazam for Your Nature Photos. Seek App.
Draw This camera prints crappy drawings of the things you photograph (DIY) with Google's quickdraw.
What Are Machine Learning Algorithms? Here’s How They Work
How to Read Articles That Use Machine Learning Users’ Guides to the Medical Literature
Google的人工智慧開源神器三歲了，它被用在很多你想不到的地方 Nov 2018
What is Natural Language Processing and How Does It Work? NLP works via preprocessing the text and then running it through the machine learning-trained algorithm.
Why Machine Learning Cannot Ignore Maximum Likelihood Estimation van der Laan & Rose 2021

Coding/code

Antigravity. I built an E-Ink photo frame using an Arduino, E-Paper display and Google Antigravity

Images

Modify

I Tried Photoshop in ChatGPT, and It Went Better Than I Expected

Drawing

This New AI Tool Can Animate Your Children’s Drawings. Animated Drawing by Meta.
How to Build an Image Generator in React Using the DALL-E API
How to Use Bing Image Creator to Make AI Art
How To Install Stable Diffusion With Prompting Cheat Sheets 5/21/2023
8 DALL-E 3 Prompts for Your Next Image 2024/3/28
The 5 Best Open-Source AI Image Generators 2024/4/23
Using AI to Create Logos: The Pros, Cons, and Best Practices
重磅炸弹！Stable Diffusion 3 终于开源了！ 2024/7
Stable Diffusion 3 in R? Why not? Thanks to {reticulate} 2024/9/1
Run it locally
- Stable Diffusion web UI
Want Powerful Local AI Image Generation on Windows? Use This Tool 4/21/2024
https://github.com/lllyasviel/Fooocus
不止吉卜力！GPT-4o新玩法全網瘋傳，網友：AI成精了 convert this photo to studio ghibli style anime

Describe images

GeoSpy

https://geospy.web.app/

Videos

Music

Games

Write a complete Python game using only the standard pygame library (no external dependencies). The game should have a retro synthwave aesthetic with neon-like colors and a grid or night-sky background. The player controls a red square that can move left and right at the bottom of the screen using the arrow keys. Blue obstacles fall from the top of the screen, and the goal is to avoid them. Additionally, include the following features:
- Show a start screen with instructions, including:
  - “Press R to restart”
  - “Use Up/Down arrows to change obstacle speed”
- The red square should be smaller than the original version (e.g., 30x30 pixels).
- The game should be easy to play — falling speeds should start slow and obstacles should be spaced out enough for beginners.
- Players should be able to press the Up arrow to increase obstacle falling speed and the Down arrow to decrease it.
- The game should run smoothly with no errors, and the code should be fully self-contained in a single file.

Text to/from speech

文字轉語音、語音轉文字！這幾種方法你最好要知道

ChatTTS 最强文本转语音！一键本地安装，100%成功！效果逼真如真人，完全免费开源！
- ChatTTS 本地部署教程！目前最好用的文字转语音工具！
- ChatTTS-ui
- MY GPU is 4GB. By default, GPU is not used. I am using the docker compose method. Following the instruction at 安装了CUDA，为什么还是为CPU呢？ #106, I just need to open ChatTTS/core.py and on line 78, change "4096" to "2048". Bingo! Verify by nvidia-smi -l 1.

Whisper
- Running Whisper AI for Real-Time Speech-to-Text on Linux

OpenAI Whisper ASR Box
- This local voice-to-text app replaced every paid service for me

IndexTTS in github
- IndexTTS Voice Cloning and TTS in 4GB VRAM! (Local Test & Install)

Chatterbox TTS - SoTA open-source TTS
- I cloned my voice with a local voice model and the result was unsettlingly good

dia
- Machine Learning in Linux: Dia – 1.6B parameter text to speech model

Bioinformatics

AI and statistics

What are the most important statistical ideas of the past 50 years

Four Deep Learning Papers to Read in June 2021

FDA Elsa

FDA推出通用AI工具Elsa，可望加速審查流程

Neural network

Types of artificial neural networks

https://en.wikipedia.org/wiki/Types_of_artificial_neural_networks

neuralnet package

nnet package

sauron package

Explaining predictions of Convolutional Neural Networks with 'sauron' package

OneR package

So, what is AI really?

h2o package

https://cran.r-project.org/web/packages/h2o/index.html

shinyML package

shinyML - Compare Supervised Machine Learning Models Using Shiny App

LSBoost

Explainable 'AI' using Gradient Boosted randomized networks Pt2 (the Lasso)

LightGBM/Light Gradient Boosting Machine

Survival data

Simulated neural network

Simulated Neural Network with Bootstrapping Time Series Data

Languages for machine learning

GitHub: The top 10 programming languages for machine learning

Keras (high level library)

Keras is a model-level library, providing high-level building blocks for developing deep-learning models. It doesn’t handle low-level operations such as tensor manipulation and differentiation. Instead, it relies on a specialized, well-optimized tensor library to do so, serving as the backend engine of Keras.

Currently, the three existing backend implementations are the TensorFlow backend, the Theano backend, and the Microsoft Cognitive Toolkit (CNTK) backend.

On Ubuntu, we can install required packages by

$ sudo apt-get install build-essential cmake git unzip \
                  pkg-config libopenblas-dev liblapack-dev
$ sudo apt-get install python-numpy python-scipy python- matplotlib python-yaml
$ sudo apt-get install libhdf5-serial-dev python-h5py
$ sudo apt-get install graphviz
$ sudo pip install pydot-ng
$ sudo apt-get install python-opencv

$ sudo pip install tensorflow  # CPU only
$ sudo pip install tensorflow-gpu # GPU support

$ sudo pip install theano

$ sudo pip install keras
$ python -c "import keras; print keras.__version__"
$ sudo pip install --upgrade keras  $ Upgrade Keras

To configure the backend of Keras, see Introduction to Python Deep Learning with Keras.

Example 1: DeepDecon.

Model Definition: In train_model.py, model = Sequential() defines a neural network model using the Keras Sequential API. It adds several dense (fully connected) layers with dropout for regularization. The activation function is set to ReLU for hidden layers and sigmoid or softmax for the output layer, depending on the number of output classes.
Model Compilation: self.model.compile(loss=self.loss, optimizer=self.optimizer, metrics=[rmse, 'mse', metrics.mae]) compiles the model, specifying the loss function, optimizer, and evaluation metrics. The custom RMSE function is included as one of the metrics.
Model Training: history = self.model.fit(X_tr, y_tr, batch_size=self.batch_size, epochs=self.epochs, validation_data=validation_data, callbacks=callbacks, shuffle=True, verbose=verbose) trains the model on the training data (X_tr, y_tr) with specified batch size and number of epochs. It also uses validation data for early stopping if enabled.
Early Stopping: if self.early_stopping sets up early stopping to prevent overfitting by monitoring the validation loss and stopping training if it doesn’t improve for a specified number of epochs.

In the eval.py code,

Loading Models models = {} This section loads pre-trained models from specified paths and stores them in a dictionary. The custom RMSE function is used during model loading.
Calculating Differences def get_difference() This function calculates the differences between the true labels and the predicted labels. It returns the minimum and maximum differences, as well as the difference array.
Single Prediction: def get_single_prediction() This function performs a single prediction by iteratively refining the prediction interval until it stabilizes.
Batch Prediction: def get_prediction() This function performs predictions for a batch of input data by calling "get_single_prediction" for each input sample.
Main Function: This section sets up argument parsing, loads the test data, performs predictions, and saves the results to a specified file.

TensorFlow (backend library)

Basic

https://www.tensorflow.org/
- https://www.tensorflow.org/install/docker
https://tensorflow.rstudio.com/
- R interface to Keras. I followed the instruction for the installation but got an error of illegal operand. The solution is to use an older version of tensorflow; see here. library(keras); install_keras(tensorflow = "1.5") (Ubuntu 16.04, Phenom(tm) II X6 1055T)
- https://rviews.rstudio.com/2018/04/03/r-and-tensorflow-presentations/, Slides
- https://hub.docker.com/r/andrie/tensorflowr/, https://hub.docker.com/r/rocker/ml/dockerfile (outdated)
Deep Learning on Biowulf
Raspberry Pi
- How to Install Tensorflow on Raspberry Pi
Books
- Deep Learning with R by François Chollet with J. J. Allaire, 2018. ISBN-10: 161729554X (available on safaribooksonline)
- Deep Learning with Python by François Chollet, 2017 (available on safaribooksonline)
- Deep Learning by Ian Goodfellow and Yoshua Bengio and Aaron Courville
Enterprise Web Services with Neural Networks Using R and TensorFlow A docker image was created based on R 3.5.0 using R libraries from MRAN’s July 2nd, 2018 snapshot, as well as Miniconda 3 version 4.4.10 for python.
Deep Learning Glossary
- http://www.wildml.com/deep-learning-glossary/
- What is an epoch (related to batch) in deep learning?, Epoch vs Iteration when training neural networks. Example: if you have 1000 training examples, and your batch size is 500, then it will take 2 iterations to complete 1 epoch. Since "batch" depends on the partition of the entire samples, we need different partitions (epoches) in order to get an unbiased result.
Best Books to learn Tensorflow

Some terms

Machine Learning Glossary from developers.google.com

Tensor

Tensors for Neural Networks, Clearly Explained!!!

Dense layer and dropout layer

In Keras, what is a "dense" and a "dropout" layer?

Fully-connected layer (= dense layer). You can choose "relu" or "sigmoid" or "softmax" activation function.

Activation function

Artificial neural network -> Neural networks as functions [math]\displaystyle{ \textstyle f (x) = K \left(\sum_i w_i g_i(x)\right) }[/math] where K (commonly referred to as the activation function) is some predefined function, such as the hyperbolic tangent or sigmoid function or softmax function or rectifier function.
Rectifier/ReLU f(x) = max(0, x).
Sigmoid. Binary problem. Logistic function and hyperbolic tangent tanh(x) are two examples of sigmoid functions.
Softmax. Multiclass classification.

Backpropagation

https://en.wikipedia.org/wiki/Backpropagation

Convolutional network

https://en.wikipedia.org/wiki/Convolutional_neural_network

Deep Learning with Python

Jupyter notebooks for the code samples of the book "Deep Learning with Python"

sudo apt install python3-pip python3-dev

sudo apt install build-essential cmake git unzip \
   pkg-config libopenblas-dev liblapack-dev
sudo apt-get install python3-numpy python3-scipy python3-matplotlib \
   python3-yaml
sudo apt install libhdf5-serial-dev python3-h5py
sudo apt install graphviz
sudo pip3 install pydot-ng

# sudo apt-get install python-opencv
# https://stackoverflow.com/questions/37188623/ubuntu-how-to-install-opencv-for-python3
# https://askubuntu.com/questions/783956/how-to-install-opencv-3-1-for-python-3-5-on-ubuntu-16-04-lts

sudo pip3 install keras

Colorize black-and-white photos

Keras using R

R Markdown Notebooks for "Deep Learning with R"
R interface to Keras
Deep Neural Network in R
Python vs R
Derivative of a tensor operation: the gradient
- Define loss_value = f(W) = dot(W, x)
- W1 = W0 - step * gradient(f)(W0)
Stochastic gradient descent
Tensor operations:
- relu(x) = max(0, x)
- Each neural layer from our first network example transforms its input data:output = relu(dot(W, input) + b) where W and b are the weights or trainable parameters of the layer.

Training process:

Draw a batch of X and Y
Run the network on x (a step called the forward pass) to obtain predictions y_pred.
- How many layers to use.
- How many “hidden units” to chose for each layer.
Compute the loss of the network on the batch
- loss
- optimizer: determines how learning proceeds (how the network will be updated based on the loss function). It implements a specific variant of stochastic gradient descent (SGD).
- metrics
Update all weights of the network in a way that slightly reduces the loss on this batch.
- batch_size
- epochs (=iteration over all samples in a batch_size of samples)

Keras (in order to use Keras, you need to install TensorFlow or CNTK or Theano):

Define your training data: input tensors and target tensors.

Define a network of layers (or model). Two ways to define a model:

using the keras_model_sequential() function (only for linear stacks of layers, which is the most common network architecture by far) or

model <- keras_model_sequential() %>%
  layer_dense(units = 32, input_shape = c(784)) %>%
  layer_dense(units = 10, activation = "softmax")

the functional API (for directed acyclic graphs of layers, which let you build completely arbitrary architectures)

input_tensor <- layer_input(shape = c(784))

output_tensor <- input_tensor %>%
  layer_dense(units = 32, activation = "relu") %>%
  layer_dense(units = 10, activation = "softmax")

model <- keras_model(inputs = input_tensor, outputs = output_tensor)

Compile the learning process by choosing a loss function, an optimizer, and some metrics to monitor.

model %>% compile(
  optimizer = optimizer_rmsprop(lr = 0.0001),
  loss = "mse",
  metrics = c("accuracy")
)

Iterate on your training data by calling the fit() method of your model.

model %>% fit(input_tensor, target_tensor, batch_size = 128, epochs = 10)

Custom loss function

Custom Loss functions for Deep Learning: Predicting Home Values with Keras for R

Metrics

https://machinelearningmastery.com/custom-metrics-deep-learning-keras-python/

Docker RStudio IDE

Assume we are using rocker/rstudio IDE, we need to install some packages first in the OS.

$ docker run -d -p 8787:8787 -e USER=XXX -e PASSWORD=XXX --name rstudio rocker/rstudio

$ docker exec -it rstudio bash
# apt update
# apt install python-pip python-dev
# pip install virtualenv

And then in R,

install.packages("keras")
library(keras)
install_keras(tensorflow = "1.5")

Use your own Dockerfile

Data Science for Startups: Containers Building reproducible setups for machine learning

Some examples

See Tensorflow for R from RStudio for several examples.

Binary data (Chapter 3.4)

The final layer will use a sigmoid activation so as to output a probability (a score between 0 and 1, indicating how likely the sample is to have the target “1”.
A relu (rectified linear unit) is a function meant to zero-out negative values, while a sigmoid “squashes” arbitrary values into the [0, 1] interval, thus outputting something that can be interpreted as a probability.

library(keras)
imdb <- dataset_imdb(num_words = 10000)
c(c(train_data, train_labels), c(test_data, test_labels)) %<-% imdb

# Preparing the data
vectorize_sequences <- function(sequences, dimension = 10000) {...}
x_train <- vectorize_sequences(train_data)
x_test <- vectorize_sequences(test_data)
y_train <- as.numeric(train_labels)
y_test <- as.numeric(test_labels)

# Build the network
## Two intermediate layers with 16 hidden units each
## The final layer will output the scalar prediction
model <- keras_model_sequential() %>% 
  layer_dense(units = 16, activation = "relu", input_shape = c(10000)) %>% 
  layer_dense(units = 16, activation = "relu") %>% 
  layer_dense(units = 1, activation = "sigmoid")
model %>% compile(
  optimizer = "rmsprop",
  loss = "binary_crossentropy",
  metrics = c("accuracy")
)
model %>% fit(x_train, y_train, epochs = 4, batch_size = 512)
## Error in py_call_impl(callable, dots$args, dots$keywords) : MemoryError: 
## 10.3GB memory is necessary on my 16GB machine

# Validation
results <- model %>% evaluate(x_test, y_test)

# Prediction on new data
model %>% predict(x_test[1:10,])

Multi class data (Chapter 3.5)

Goal: build a network to classify Reuters newswires into 46 different mutually-exclusive topics.
You end the network with a dense layer of size 46. This means for each input sample, the network will output a 46-dimensional vector. Each entry in this vector (each dimension) will encode a different output class.
The last layer uses a softmax activation. You saw this pattern in the MNIST example. It means the network will output a probability distribution over the 46 different output classes: that is, for every input sample, the network will produce a 46-dimensional output vector, where outputi is the probability that the sample belongs to class i. The 46 scores will sum to 1.

library(keras)
reuters <- dataset_reuters(num_words = 10000)
c(c(train_data, train_labels), c(test_data, test_labels)) %<-% reuters

model <- keras_model_sequential() %>% 
  layer_dense(units = 64, activation = "relu", input_shape = c(10000)) %>% 
  layer_dense(units = 64, activation = "relu") %>% 
  layer_dense(units = 46, activation = "softmax")
model %>% compile(
  optimizer = "rmsprop",
  loss = "categorical_crossentropy",
  metrics = c("accuracy")
)
history <- model %>% fit(
  partial_x_train,
  partial_y_train,
  epochs = 9,
  batch_size = 512,
  validation_data = list(x_val, y_val)
)
results <- model %>% evaluate(x_test, one_hot_test_labels)
# Prediction on new data
predictions <- model %>% predict(x_test)

MNIST dataset.

Regression data (Chapter 3.6)

Because so few samples are available, we will be using a very small network with two hidden layers. In general, the less training data you have, the worse overfitting will be, and using a small network is one way to mitigate overfitting.
Our network ends with a single unit, and no activation (i.e. it will be linear layer). This is a typical setup for scalar regression (i.e. regression where we are trying to predict a single continuous value). Applying an activation function would constrain the range that the output can take. Here, because the last layer is purely linear, the network is free to learn to predict values in any range.
We are also monitoring a new metric during training: mae. This stands for Mean Absolute Error.

library(keras)
dataset <- dataset_boston_housing()
c(c(train_data, train_targets), c(test_data, test_targets)) %<-% dataset

build_model <- function() {
  model <- keras_model_sequential() %>% 
    layer_dense(units = 64, activation = "relu", 
                input_shape = dim(train_data)[[2]]) %>% 
    layer_dense(units = 64, activation = "relu") %>% 
    layer_dense(units = 1) 
    
  model %>% compile(
    optimizer = "rmsprop", 
    loss = "mse", 
    metrics = c("mae")
  )
}
# K-fold CV
k <- 4
indices <- sample(1:nrow(train_data))
folds <- cut(1:length(indices), breaks = k, labels = FALSE) 
num_epochs <- 100
all_scores <- c()
for (i in 1:k) {
  cat("processing fold #", i, "\n")
  # Prepare the validation data: data from partition # k
  val_indices <- which(folds == i, arr.ind = TRUE) 
  val_data <- train_data[val_indices,]
  val_targets <- train_targets[val_indices]
  
  # Prepare the training data: data from all other partitions
  partial_train_data <- train_data[-val_indices,]
  partial_train_targets <- train_targets[-val_indices]
  
  # Build the Keras model (already compiled)
  model <- build_model()
  
  # Train the model (in silent mode, verbose=0)
  model %>% fit(partial_train_data, partial_train_targets,
                epochs = num_epochs, batch_size = 1, verbose = 0)
                
  # Evaluate the model on the validation data
  results <- model %>% evaluate(val_data, val_targets, verbose = 0)
  all_scores <- c(all_scores, results$mean_absolute_error)
}

PyTorch

An R Shiny app to recognize flower species

Google Cloud Platform

Choosing between TensorFlow/Keras, BigQuery ML, and AutoML Natural Language for text classification Comparing text classification done three ways on Google Cloud Platform

Amazon

Amazon's Machine Learning University is making its online courses available to the public

Workshops

Notebooks from the Practical AI Workshop 2019

OpenML.org

R interface to OpenML.org

Biology

Predicting Splicing from Primary Sequence with Deep Learning Jaganathan et al 2018
Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks Li et al BMC Bioinformatics 2019
DL 101: Basic introduction to deep learning with its application in biomedical related fields 2022