claude-memory-storai/project_dgx.md at main

Files

Claude Code f3949880a1 Auto-sync memory 2026-04-17_21:55

2026-04-17 21:55:00 +03:00

name, description, type, originSessionId

name	description	type	originSessionId
DGX Spark Nodes	dgxmain and dgxsec - DGX Spark GB10 ARM nodes with full AI stack, Ollama, LiteLLM, credentials	project	f6f70bf9-8d74-459e-a0c1-f9f2333923d4

dgxmain (10.100.102.241, Tailscale 100.124.217.84)

ARM Cortex-X925+A725, 20 cores, 122GB RAM, NVIDIA GB10 GPU
3.6TB NVMe (29% used)
SSH: yohay / Biton24680#@ (login), Biton24680#@ (sudo)
Ollama: running natively (systemd), port 11434
Models: nemotron-3-super:120b, gpt-oss:120b, llama3:70b, deepseek-r1:70b, qwen3:32b, gemma3:27b, devstral:23.6b, gemma3:4b, nomic-embed-text
LiteLLM proxy: port 4000, key sk-dgxmain-litellm-2026
Docker containers: Dify stack, Open WebUI, Gitea, n8n, Portainer, Caddy, WordPress, ArangoDB, TensorRT-LLM, Kali

Why: These are the most powerful compute nodes in the cluster. Not in K8s due to ARM compatibility issues.

How to apply: Use standalone Docker Compose for GPU workloads. LiteLLM on dgxmain is the unified inference API for all services.