Installation & Administrator GuideΒΆ
- Introduction
- Required and Recommended Components
- Initial Installation of Scyld ClusterWare
- Common Additional Configuration and Software
- Configure Hostname
- Choosing An Alternate Database
- Couchbase auto-failover
- Configure Authentication
- Disable/Enable Chain Booting
- Installing Optional ClusterWare Software
- Job Schedulers
- Kubernetes
- scyld-nss Name Service Switch (NSS) Tool
- Firewall Configuration
- Install OpenMPI, MPICH, and/or MVAPICH
- Configure IP Forwarding
- Install Additional Tools
- Node Images and Boot Configurations
- Compute Node Fields
- Compute Nodes IPMI access
- Boot Configurations
- Creating PXEboot Images
- Recreating the Default Image
- Modifying PXEboot Images
- Capturing and Importing PXEboot Images
- Deleting unused images and boot configurations
- Copying boot configurations between head nodes
- Wrapper scripts
- Adding 3rd-party software
- Using Kickstart
- Using RHCOS
- Booting Diskful Compute Nodes
- Interacting with Compute Nodes
- Securing the Cluster
- Monitoring the Status of the Cluster
- Graphical Interface
- Managing Multiple Head Nodes
- Managing Node Failures
- Managing Large Clusters
- Backup and Restore
- Updating Scyld ClusterWare
- Troubleshooting ClusterWare
- Failing PXE Network Boot
- Kickstart Failing
- Head Node Filesystem Is 100% Full
- Exceeding System Limit of Network Connections
- etcd Database Exceeds Size Limit
- Failing To Boot From Local Storage
- IP Forwarding
- Soft Power Control Failures
- Head Nodes Disagree About Compute Node State
- Finding Further Information
- Contacting Penguin Computing Support
- IPMI
- Services, Ports, Protocols