OpenStack migration is failing between two hosts. Using OpenStack Ussuri. Both hosts have VMs running on them and are able to host new VMs.
Both hosts appear up and available in the compute service list:
darren@jacob:admin:~$ openstack compute service list
+--------------------------------------+----------------+--------+----------+---------+-------+-- --------------------------+
| ID | Binary | Host | Zone | Status | State | Updated At |
+--------------------------------------+----------------+--------+----------+---------+-------+----------------------------+
| 65640c54-641f-4cbf-91ba-dac39764ac31 | nova-scheduler | jacob | internal | enabled | up | 2021-01-15T23:57:22.000000 |
| 0aa0b80b-09e6-4e61-b222-dbf62b43ddda | nova-conductor | jacob | internal | enabled | up | 2021-01-15T23:57:26.000000 |
| f4dce946-94cf-482a-83d2-b32f1c7f87b5 | nova-compute | joseph | nova | enabled | up | 2021-01-15T23:57:19.000000 |
| 2b149fe0-9b9b-44b8-8d70-9fa5cf3b968b | nova-compute | judah | nova | enabled | up | 2021-01-15T23:57:27.000000 |
+--------------------------------------+----------------+--------+----------+---------+-------+----------------------------+
Here is an exerpt of the error from the controller /var/log/nova/nova-conductor.log
:
2021-01-15 15:39:43.263 30830 ERROR nova.conductor.tasks.migrate [req-a62d9ff4-be8b-4870-81a4-ebaf1c85ce37 993afae9dd9746b48f72fcafd974aef7 e98eaaf8e7ff403cb1180e9e29148890 - default default] [instance: 001f9cad-25ca-4f2d-b32c-01953d854dc5] Unable to find record for source node joseph.mcgrandle.com on joseph: nova.exception.ComputeHostNotFound: Compute host joseph could not be found.
2021-01-15 15:39:43.263 30830 WARNING nova.scheduler.utils [req-a62d9ff4-be8b-4870-81a4-ebaf1c85ce37 993afae9dd9746b48f72fcafd974aef7 e98eaaf8e7ff403cb1180e9e29148890 - default default] Failed to compute_task_migrate_server: Compute host joseph could not be found.: nova.exception.ComputeHostNotFound: Compute host joseph could not be found.
2021-01-15 15:39:43.264 30830 WARNING nova.scheduler.utils [req-a62d9ff4-be8b-4870-81a4-ebaf1c85ce37 993afae9dd9746b48f72fcafd974aef7 e98eaaf8e7ff403cb1180e9e29148890 - default default] [instance: 001f9cad-25ca-4f2d-b32c-01953d854dc5] Setting instance to ACTIVE state.: nova.exception.ComputeHostNotFound: Compute host joseph could not be found.
2021-01-15 15:39:43.318 30830 ERROR oslo_messaging.rpc.server [req-a62d9ff4-be8b-4870-81a4-ebaf1c85ce37 993afae9dd9746b48f72fcafd974aef7 e98eaaf8e7ff403cb1180e9e29148890 - default default] Exception during message handling: nova.exception.ComputeHostNotFound: Compute host joseph could not be found.
I've tried re-populating the nova database with
# su -s /bin/sh -c "nova-manage db sync" nova
And also trying to re-discover compute hosts:
# su -s /bin/sh -c "nova-manage cell_v2 discover_hosts --verbose" nova
But nothing appears to be making any difference. Thanks for any pointers or help.
Update: here is the output of the requested commands:
darren@jacob:admin:~$ sudo nova-manage cell_v2 list_hosts
+-----------+--------------------------------------+----------+
| Cell Name | Cell UUID | Hostname |
+-----------+--------------------------------------+----------+
| cell1 | 9095885b-466f-41d4-9c85-45b5af7b5ce2 | joseph |
| cell1 | 9095885b-466f-41d4-9c85-45b5af7b5ce2 | judah |
| cell1 | 9095885b-466f-41d4-9c85-45b5af7b5ce2 | reuben |
+-----------+--------------------------------------+----------+
darren@jacob:admin:~$ sudo nova-manage cell_v2 list_cells
+-------+--------------------------------------+-------------------------------------+--------------------------------------------+----------+
| Name | UUID | Transport URL | Database Connection | Disabled |
+-------+--------------------------------------+-------------------------------------+--------------------------------------------+----------+
| cell0 | 00000000-0000-0000-0000-000000000000 | none:/ | mysql+pymysql://nova:****@jacob/nova_cell0 | False |
| cell1 | 9095885b-466f-41d4-9c85-45b5af7b5ce2 | rabbit://openstack:****@jacob:5672/ | mysql+pymysql://nova:****@jacob/nova | False |
+-------+--------------------------------------+-------------------------------------+--------------------------------------------+----------+
and here is the updated compute service list
output after adding reuben
:
darren@jacob:admin:~$ openstack compute service list
+--------------------------------------+----------------+--------+----------+---------+-------+----------------------------+
| ID | Binary | Host | Zone | Status | State | Updated At |
+--------------------------------------+----------------+--------+----------+---------+-------+----------------------------+
| 65640c54-641f-4cbf-91ba-dac39764ac31 | nova-scheduler | jacob | internal | enabled | up | 2021-01-26T08:04:07.000000 |
| 0aa0b80b-09e6-4e61-b222-dbf62b43ddda | nova-conductor | jacob | internal | enabled | up | 2021-01-26T08:04:08.000000 |
| f4dce946-94cf-482a-83d2-b32f1c7f87b5 | nova-compute | joseph | nova | enabled | up | 2021-01-26T08:04:09.000000 |
| 2b149fe0-9b9b-44b8-8d70-9fa5cf3b968b | nova-compute | judah | nova | enabled | up | 2021-01-26T08:04:08.000000 |
| d306fe4f-1d12-41b7-a2c9-8f856247268b | nova-compute | reuben | nova | enabled | up | 2021-01-26T08:04:15.000000 |
+--------------------------------------+----------------+--------+----------+---------+-------+----------------------------+