Files
proxmox/docs/archive/historical/REMAINING_STEPS.md
defiQUG cb47cce074 Complete markdown files cleanup and organization
- Organized 252 files across project
- Root directory: 187 → 2 files (98.9% reduction)
- Moved configuration guides to docs/04-configuration/
- Moved troubleshooting guides to docs/09-troubleshooting/
- Moved quick start guides to docs/01-getting-started/
- Moved reports to reports/ directory
- Archived temporary files
- Generated comprehensive reports and documentation
- Created maintenance scripts and guides

All files organized according to established standards.
2026-01-06 01:46:25 -08:00

330 lines
7.1 KiB
Markdown

# Remaining Steps - Proxmox VE Configuration
**Date:** 2025-01-20
**Status:** Critical tasks complete, optional steps remaining
---
## ✅ Completed Tasks
1.**Hostname Migration**
- r630-01: `pve``r630-01`
- r630-02: `pve2``r630-02`
2.**IP Address Audit**
- 34 VMs/containers scanned
- 0 IP conflicts ✅
- All IPs documented ✅
3.**Storage Configuration**
- r630-01: thin1 (200GB) + local-lvm (200GB) enabled ✅
- r630-02: thin1 (113GB available) + thin2-thin6 enabled ✅
- Total: ~2.4TB+ available ✅
---
## ⚠️ HIGH PRIORITY - Remaining Steps
### 1. Update Cluster Configuration ⚠️ CRITICAL
**Issue:** Cluster still shows old hostnames (`pve`, `pve2`) instead of new hostnames (`r630-01`, `r630-02`)
**Current Status:**
```
Cluster nodes:
- Node 1: ml110 ✅
- Node 2: pve (should be r630-01) ⚠️
- Node 3: pve2 (should be r630-02) ⚠️
```
**Action Required:**
```bash
# Option 1: Update cluster node names (if supported)
pvecm updatecerts -f
# May require cluster reconfiguration
# Option 2: Verify if hostname changes are sufficient
# Cluster may auto-update on next quorum change
```
**Verification:**
```bash
pvecm status
pvecm nodes
# Should show r630-01 and r630-02
```
**Impact:** Cluster operations may reference old hostnames, could cause confusion
---
### 2. Verify VMs on r630-02 Storage ⚠️ RECOMMENDED
**Issue:** Storage shows VMs exist (VMIDs: 100, 101, 102, 103, 104, 105, 130, 5000, 6200 on thin1, VMID 7800 on thin4), but `pct list` and `qm list` show nothing
**Action Required:**
```bash
ssh root@192.168.11.12
# Check all storage for VMs
pvesm list thin1
pvesm list thin4
# Check if VMs are registered
pct list
qm list
# Check VM configurations
ls -la /etc/pve/nodes/r630-02/lxc/
ls -la /etc/pve/nodes/r630-02/qemu-server/
# Check if VMs are on different node
pvesh get /nodes --output-format json | jq
```
**Possible Issues:**
- VMs may be registered on different node (ml110 or r630-01)
- VMs may be orphaned (storage exists but not registered)
- VMs may need to be re-registered
**Impact:** Need to understand VM status before starting new VMs
---
### 3. Test Storage Performance ⚠️ RECOMMENDED
**Action Required:**
```bash
# On r630-01
ssh root@192.168.11.11
# Create test container
pct create 9999 local:vztmpl/ubuntu-22.04-standard_22.04-1_amd64.tar.zst \
--storage local-lvm --hostname test-storage --net0 name=eth0,bridge=vmbr0
# Test performance
# Delete test container
pct destroy 9999
# On r630-02
ssh root@192.168.11.12
# Create test container
pct create 9999 local:vztmpl/ubuntu-22.04-standard_22.04-1_amd64.tar.zst \
--storage thin1 --hostname test-storage --net0 name=eth0,bridge=vmbr0
# Test performance
# Delete test container
pct destroy 9999
```
**Purpose:** Verify storage is working correctly before deploying production VMs
---
## 📋 OPTIONAL - Optimization Steps
### 4. Distribute VMs Across Hosts 📋 RECOMMENDED
**Current State:** All 34 VMs on ml110 (overloaded)
**Recommended Distribution:**
- **ml110:** Keep 10-15 lightweight/management VMs
- **r630-01:** Migrate 10-15 medium workload VMs
- **r630-02:** Migrate 10-15 heavy workload VMs (best CPU - 56 cores)
**Migration Commands:**
```bash
# From ml110 to r630-01
pct migrate <VMID> r630-01 --storage local-lvm
# From ml110 to r630-02
pct migrate <VMID> r630-02 --storage thin2 # or thin3, thin5, thin6
```
**Benefits:**
- Better performance (ml110 CPU is slower)
- Better resource utilization
- Improved redundancy
**Estimated Time:** 1-2 hours (depending on VM sizes)
---
### 5. Implement Storage Monitoring 📋 RECOMMENDED
**Action Required:**
```bash
# Set up storage alerts (manual or via monitoring system)
# Monitor:
# - Storage usage >80%
# - Thin pool metadata usage
# - Storage growth trends
```
**Tools:**
- Proxmox built-in monitoring
- External monitoring (Prometheus, Grafana)
- Custom scripts
**Purpose:** Proactive alerting before storage issues occur
---
### 6. Security Hardening 📋 RECOMMENDED
**Actions:**
1. **Update Passwords**
```bash
# Change weak passwords on r630-01 and r630-02
passwd root
```
2. **SSH Key Authentication**
```bash
# Set up SSH keys instead of passwords
ssh-copy-id root@192.168.11.11
ssh-copy-id root@192.168.11.12
```
3. **Firewall Configuration**
```bash
# Review and configure firewall rules
# Restrict access where needed
```
4. **Access Control Review**
- Review user permissions
- Implement least privilege
- Audit access logs
---
### 7. Network Optimization 📋 OPTIONAL
**Actions:**
1. **VLAN Migration** (Planned)
- Segment network by service type
- Improve security
- Better traffic management
2. **Network Monitoring**
- Monitor bandwidth usage
- Track performance
- Alert on issues
---
### 8. Documentation Updates 📋 OPTIONAL
**Actions:**
1. Update any scripts/configs that reference old hostnames (`pve`, `pve2`)
2. Update documentation with new hostnames
3. Update inventory files if needed
**Search for references:**
```bash
grep -r "pve\|pve2" scripts/ config/ docs/ --exclude-dir=.git
```
---
## 🚀 Ready to Start VMs
### Pre-Start Checklist Status
- [x] Hostnames migrated ✅
- [x] IP addresses audited ✅
- [x] No IP conflicts ✅
- [x] Storage enabled on r630-01 ✅
- [x] Storage enabled on r630-02 ✅
- [x] Proxmox services operational ✅
- [ ] **Cluster configuration updated** ⚠️
- [ ] **VMs on r630-02 verified** ⚠️
- [ ] **Storage tested** ⚠️
### Can Start VMs Now?
**Yes, but recommended to:**
1. Update cluster configuration first (prevents confusion)
2. Verify r630-02 VMs (understand existing state)
3. Test storage (ensure it works)
**Critical blockers:** None - all critical tasks complete
---
## 📊 Priority Summary
### 🔴 CRITICAL (Do Before Production)
1. Update cluster configuration
2. Verify r630-02 VMs status
### ⚠️ HIGH PRIORITY (Recommended)
3. Test storage performance
4. Distribute VMs across hosts
### 📋 RECOMMENDED (For Optimization)
5. Implement monitoring
6. Security hardening
7. Network optimization
8. Documentation updates
---
## 🎯 Quick Action Plan
### Immediate (15-30 minutes)
1. Update cluster configuration
2. Verify r630-02 VMs
### Short-term (1-2 hours)
3. Test storage
4. Plan VM distribution
### Long-term (Ongoing)
5. Implement monitoring
6. Security hardening
7. Network optimization
---
## 📝 Commands Reference
### Cluster Management
```bash
# Check cluster status
pvecm status
# List nodes
pvecm nodes
# Update certificates (may help with hostname updates)
pvecm updatecerts -f
```
### VM Verification
```bash
# List all VMs on a node
pct list
qm list
# Check storage contents
pvesm list <storage-name>
# Check VM configurations
ls -la /etc/pve/nodes/<node>/lxc/
ls -la /etc/pve/nodes/<node>/qemu-server/
```
### Storage Testing
```bash
# Create test container
pct create 9999 local:vztmpl/ubuntu-22.04-standard_22.04-1_amd64.tar.zst \
--storage <storage-name> --hostname test
# Destroy test container
pct destroy 9999
```
---
**Last Updated:** 2025-01-20
**Status:** Critical tasks complete, optional steps available