VM Management Troubleshooting | Common Issues and Solutions
This guide helps you resolve common issues encountered when using Nife VM Management.
General Troubleshooting Steps#
Before diving into specific issues, try these general troubleshooting steps:
Refresh the Dashboard
- Click the refresh button to reload instance data
- Clear browser cache (Ctrl+Shift+Delete)
- Reload the page (F5 or Ctrl+R)
Verify Credentials
- Confirm credentials are still valid in your cloud provider
- Check keys haven't been rotated or revoked
- Verify IAM permissions are still assigned
Check Cloud Provider Status
- Visit your cloud provider's status page
- AWS: https://status.aws.amazon.com
- GCP: https://status.cloud.google.com
- Azure: https://status.azure.com
Review Error Messages
- Read error messages carefully for specific details
- Screenshot errors for support tickets
- Check browser console (F12) for JavaScript errors
Wait and Retry
- Some operations are asynchronous
- Wait 1-2 minutes for changes to propagate
- Retry the operation
Creating VMs Issues#
"Organization not found" Error#
Cause: Organization doesn't exist or isn't accessible
Solutions:
- Refresh the page to reload organizations
- Verify you have access to the organization
- Ask organization admin to grant access
- Create a new organization if needed
- Contact Nife support if issue persists
"Cloud provider type is required" Error#
Cause: No cloud provider selected
Solutions:
- Select a cloud provider from the dropdown
- Choose between AWS, GCP, or Azure
- Don't leave it blank
- Ensure dropdown opened correctly
"Instance name is required" Error#
Cause: Instance name field is empty
Solutions:
- Enter a descriptive instance name
- Use lowercase letters, numbers, and hyphens
- Name should be 3-50 characters
- Make name unique within organization
File Upload Issues (GCP)#
"Please enter an instance name before uploading the file"#
Cause: Service account file upload attempted without instance name
Solutions:
- First, fill in the Instance Name field
- Then select the service account JSON file
- Click Upload
"Please select a valid JSON file"#
Cause: Selected file is not in JSON format
Solutions:
- Verify you selected the correct file from GCP
- File should be named something like
project-id-xxxxx.json - Don't rename the file
- Try downloading the key again from GCP
- Check file extension is
.json
"Failed to upload file"#
Cause: Network issue, invalid file, or server error
Solutions:
- Check internet connection
- Try uploading again
- Refresh the page and retry
- Use a different browser
- Try smaller file size (though it shouldn't matter)
- Check file isn't corrupted
- Contact support if error persists
AWS Credential Validation Issues#
"Invalid Access Key ID"#
Cause: Access Key ID is incorrect or no longer valid
Solutions:
- Verify Access Key ID starts with
AKIA - Copy key exactly from AWS IAM console
- Check key hasn't been rotated
- Ensure key hasn't been disabled
- Verify IAM user is active
- Create new access key if needed
To create new AWS access key:
- Log into AWS console
- Go to IAM โ Users โ Your user
- Click Security credentials
- Click Create access key
- Select CLI usage
- Copy new Access Key ID
- Save Secret Access Key securely
"Invalid Secret Access Key"#
Cause: Secret key is incorrect or invalid
Solutions:
- Secret key is only visible once when created
- If lost, create a new access key
- Check for extra spaces or characters
- Verify secret key matches Access Key ID
- Use password manager to store securely
- Deactivate old key if regenerating
"Instance not found in AWS"#
Cause: Instance doesn't exist or wrong region specified
Solutions:
- Verify instance exists in AWS console
- Check instance ID format: should be
i-xxxxxxxxxxxx - Verify you're using the correct region
- Confirm instance isn't terminated
- Check instance is in correct AWS account
- Go to EC2 โ Instances and verify
"Access Denied / Permission Denied"#
Cause: IAM user doesn't have EC2 permissions
Solutions:
- Verify user has
AmazonEC2FullAccesspolicy - Or verify custom policy grants necessary permissions
- Check policy is attached to user
- Wait 1-2 minutes for IAM changes to take effect
- Sign out and back into AWS to refresh
- Create new access key after policy attachment
GCP Credential Issues#
"Service account is invalid"#
Cause: Service account key file is invalid or incorrect
Solutions:
- Download fresh service account key from GCP
- Verify file is valid JSON
- Check file contains all required fields
- Don't manually edit the JSON file
- Try uploading again
- Use a different browser
"Compute Engine API not enabled"#
Cause: Compute Engine API isn't enabled in GCP project
Solutions:
- Go to GCP Console
- Navigate to APIs & Services โ Library
- Search for "Compute Engine API"
- Click Enable
- Wait 2-3 minutes for activation
- Retry VM creation
"Instance not found in GCP"#
Cause: Instance doesn't exist or wrong zone specified
Solutions:
- Verify instance exists in GCP console
- Go to Compute Engine โ VM instances
- Verify correct zone (e.g.,
us-central1-a) - Check instance name spelling
- Ensure instance isn't deleted
- Verify project is correct
"Service account lacks permissions"#
Cause: Service account missing necessary GCP roles
Solutions:
- Go to GCP Console
- Navigate to IAM & Admin โ IAM
- Find your service account
- Click Edit
- Grant "Compute Instance Admin (v1)" role
- Wait 1-2 minutes for permissions to apply
- Retry operation
Azure Credential Issues#
"Invalid Subscription ID"#
Cause: Subscription ID is incorrect or doesn't exist
Solutions:
- Go to Azure portal
- Click Subscriptions
- Copy exact Subscription ID (GUID)
- Verify subscription is active
- Check you have access to subscription
- Avoid extra spaces when copying
"Resource Group not found"#
Cause: Resource group doesn't exist in subscription
Solutions:
- Go to Azure portal
- Click Resource groups
- Find and verify resource group name
- Check spelling exactly
- Confirm resource group in correct subscription
- Verify you have access to resource group
"Invalid Client ID / Tenant ID"#
Cause: Service principal credentials are incorrect
Solutions:
- Go to Azure AD โ App registrations
- Click your app
- Copy exact Application (client) ID
- Go to App registration Properties
- Copy exact Directory (tenant) ID
- Check GUIDs don't have extra spaces
- Create new credentials if needed
"Client secret invalid or expired"#
Cause: Client secret is wrong, expired, or no longer exists
Solutions:
- Go to Azure AD โ App registrations โ Your app
- Click Certificates & secrets
- Check if secret has expired (date shown)
- Create new client secret if expired
- Copy Value (not ID) of secret
- Immediately save secret securely (won't show again)
- Delete old expired secret
"Insufficient permissions for operation"#
Cause: Service principal lacks required permissions
Solutions:
- Go to Azure portal
- Click Subscriptions โ Your subscription
- Click Access Control (IAM)
- Click Add โ Add role assignment
- Select "Virtual Machine Contributor"
- Find and select your service principal
- Click Review + assign
- Wait 1-2 minutes for permissions to propagate
Managing VMs Issues#
"Instance not accessible" / "Cannot connect"#
Symptoms#
- Instance shows as Running in dashboard
- Cannot connect via SSH or console
- Connection timeout errors
Solutions#
For AWS:
- Go to AWS EC2 console
- Check Security Group rules
- Ensure port 22 (SSH) is allowed
- Check inbound rules allow your IP
- Verify instance has public IP
- Try accessing from AWS console first
For GCP:
- Go to GCP Compute Engine
- Click on instance
- Click Edit
- Check Firewall rules
- Ensure ssh tag is applied
- Check VPC firewall rules allow SSH
- Try gcloud ssh command first
For Azure:
- Go to Azure portal
- Find Virtual Machine
- Check Network Interface
- Verify Network Security Group (NSG)
- Ensure inbound rule allows port 22
- Check your IP isn't blocked
"Operation timed out"#
Cause: Operation took longer than expected
Solutions:
- Wait 1-2 minutes and refresh
- Check instance status in cloud provider console
- Retry the operation
- Check network connectivity
- Try different browser
- Check for service maintenance windows
Instance Won't Start#
Symptoms:
- Start button disabled or grayed out
- Error when clicking start
- Instance remains in Stopped state
Solutions:
- Check instance status in cloud provider console
- Verify cloud provider doesn't have issues
- Check for startup errors in instance logs
- Verify disk space available
- Check instance type limitations
- Try force stopping, then restarting
- Contact cloud provider support if persists
Instance Won't Stop#
Symptoms:
- Stop operation hangs
- Instance remains Running
- Timeout errors
Solutions:
- Wait longer (stop can take minutes)
- Force stop from cloud provider console
- Check for services preventing shutdown
- Check instance logs for errors
- Try restart instead
- Contact cloud provider support
Cannot Delete Instance#
Symptoms:
- Delete operation fails with error
- Instance still exists after deletion attempt
Solutions:
- Verify instance has no dependencies
- Detach storage volumes first
- Remove from load balancers
- Cancel any ongoing operations
- Try deleting from cloud provider console
- Check for resource locks
- Verify proper permissions
"Insufficient permissions" Error#
Cause: Your Nife account lacks permissions for this organization
Solutions:
- Ask organization admin to grant access
- Verify your role in organization
- Check if you're in correct organization
- Ask for VM management permissions specifically
- Create test instance in accessible organization
Monitoring Issues#
No Metrics Displayed#
Cause: Instance too new, metrics not available, or monitoring disabled
Solutions:
- Wait 5-10 minutes for metrics to populate
- Check instance is in Running state
- Verify instance has been running >5 minutes
- Refresh the monitoring page
- Check browser console for errors
- Try different browser
Metrics Stopped Updating#
Cause: Instance offline, monitoring service issue, or connectivity problem
Solutions:
- Check instance status
- Refresh the monitoring page
- Stop and restart instance
- Check cloud provider is operational
- Wait 5 minutes and retry
- Clear browser cache
- Contact support if persists
Alerts Not Sending#
Cause: Notification method misconfigured or issue with service
Solutions:
- Verify alert threshold settings
- Check notification email/Slack is correct
- Verify notification service is enabled
- Check spam folder if email alert
- Recreate alert from scratch
- Test with manual alert trigger
- Contact support if issue continues
Export Issues#
Export File Is Empty#
Cause: No instances to export or query failed
Solutions:
- Verify you have VM instances created
- Check instances aren't filtered out
- Refresh instance list first
- Try again after refreshing page
- Check browser console for errors
Export File Is Corrupted#
Cause: Network issue during download or browser issue
Solutions:
- Try exporting again
- Use different browser
- Check available disk space
- Try JSON format instead of CSV (or vice versa)
- Check browser's download settings
CSV File Not Readable in Excel#
Cause: Encoding issue or Excel import problem
Solutions:
- Open in Google Sheets (usually works better)
- Import CSV with UTF-8 encoding
- Use Excel's Text Import Wizard
- Try opening with different application
- Export as JSON and convert
Performance Issues#
Dashboard Loads Slowly#
Cause: Too many instances, network issue, or browser performance
Solutions:
- Use filters to reduce visible instances
- Close other browser tabs
- Clear browser cache
- Refresh the page
- Try different browser
- Disable extensions
- Check internet connection
Operations Are Slow#
Cause: Cloud provider API delays, network latency, or service issues
Solutions:
- Wait for current operation to complete
- Check cloud provider status
- Try again after 1-2 minutes
- Verify network connection
- Try from different location
- Contact Nife support if recurring
Getting Help#
When to Contact Support#
Contact Nife support if:
- Issue persists after troubleshooting
- Error message unclear or unusual
- Multiple operations failing
- Data loss or security concern
- Performance issues across multiple operations
Providing Support Information#
When contacting support, include:
- Error message: Exact text or screenshot
- Steps to reproduce: How you got the error
- Instance details: Name, provider, organization
- Timing: When did the issue start
- Environment: Browser, OS, network
- Recent changes: What changed before issue
- Screenshots: Of error or issue
Support Channels#
- Email: [email protected]
- Chat: Live chat in dashboard
- Documentation: Check docs first
- Status Page: Check service status
FAQ#
Q: How long do instance operations take? A: Start/stop: 1-5 minutes, Restart: 2-10 minutes, Create: 5-20 minutes
Q: Why can't I see my instance? A: Check filters, search query, or organization selection
Q: Do I lose data when stopping an instance? A: No, stopping preserves data. Only deleting removes data.
Q: Can I undo a deletion? A: No. Deletions are permanent. Restore from backups/snapshots if available.
Q: How often are metrics updated? A: Metrics update every 1-5 minutes depending on metric type.
Next Steps#
- Managing VM Instances - Manage your instances
- Cloud Provider Setup - Configure providers
- Monitoring VMs - Monitor performance