Troubleshooting¶
Instance stuck in BUILD status¶
The possible reasons for this issue:
Networking issue between message queue service(e.g. RabbitMQ) and Trove guest instance. When creating a Trove instance, trove-taskmanager sends a message to the message queue, which is expected to be received and handled by the trove-guestagent service which is running inside the instance. The instance status should be updated by trove-guestagent service after handling. Apparently, If the trove-guestagent can’t connect with RabbitMQ, the instance status won’t be updated.
Code bug in trove-guestagent. You should be able to see some error log in trove-guestagent log file (by default,
/var/log/trove/trove-guestagent.log
).If you are using the dev mode image, it’s also possible that trove-guestagent can’t connect to Trove controller host to download trove-guestagent service code, either because of network connectivity issue or the ssh key is missing or incorrect.
In either case, you will need to ssh into the Trove instance and check the log. In order to do that, you need to make sure:
You have the admin credentials to get the IP address of Trove instance management port. The management port is a Neutron port allocated from the management network (defined by
management_networks
config option). For example, you need to log into a Trove instance named ‘test’, and the management network name is ‘trove-mgmt’, you can run:$ openstack server list | grep test | grep trove-mgmt | 810fc014-bd9f-4464-b506-1b78f37c495e | test | ACTIVE | private=10.1.0.57; trove-mgmt=192.168.254.229 | ubuntu-xenial-mysql-5.7-dev | ds1G |
The TCP 22 port is allowed in the Neutron security group (defined by
management_security_groups
config option) that applied to the management port.You have the SSH private key. The Trove instance should be created using a Nova keypair defined by
nova_keypair
config option.
After log into the instance, you can check the trove-guestagent log by:
sudo journalctl -u guest-agent.service | less # or
sudo vi /var/log/trove/trove-guestagent.log
Please contact Trove team in #openstack-trove IRC channel or send email to openstack-discuss@lists.openstack.org if help needed.
Note
The Trove instance creation time varies in different environments, the
default value of usage_timeout
option (3600 seconds) may not be applied
to all, the cloud administrator should change that based on testing so that
the instance creation should fail in a reasonable timely manner.