| Summary: | Documentation: Controller replacement guide, Delete the failed node from MongoDB section: need to add a note about finding the IP to connect to. | ||
|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | Alexander Chuzhoy <sasha> |
| Component: | documentation | Assignee: | Dan Macpherson <dmacpher> |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Alexander Chuzhoy <sasha> |
| Severity: | unspecified | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | unspecified | CC: | mcornea, sasha, srevivo |
| Target Milestone: | ga | Keywords: | Documentation, ZStream |
| Target Release: | 8.0 (Liberty) | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2016-06-16 04:41:08 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
|
Description
Alexander Chuzhoy
2016-04-15 16:11:10 UTC
Two things in "Delete the failed node from MongoDB" section:
1) The IPs (shown in the output and used for connection) in the guide differ:
sudo netstat -tulnp | grep 27017
tcp 0 0 192.168.201.47:27017 0.0.0.0:* LISTEN 13415/mongod
[heat-admin@overcloud-controller-0 ~]$ mongo --host 192.168.0.47
2) I was unable to delete the node when connected to secondary:
{
"_id" : 1,
"name" : "10.19.94.13:27017",
"health" : 0,
"state" : 8,
"stateStr" : "(not reachable/healthy)",
"uptime" : 0,
"optime" : Timestamp(1462580916, 5),
"optimeDate" : ISODate("2016-05-07T00:28:36Z"),
"lastHeartbeat" : ISODate("2016-05-07T03:55:16Z"),
"lastHeartbeatRecv" : ISODate("2016-05-07T00:33:26Z"),
"pingMs" : 0,
"syncingTo" : "10.19.94.16:27017"
},
{
"_id" : 2,
"name" : "10.19.94.14:27017",
"health" : 1,
"state" : 2,
"stateStr" : "SECONDARY",
"uptime" : 18603,
"optime" : Timestamp(1462593277, 6),
"optimeDate" : ISODate("2016-05-07T03:54:37Z"),
"self" : true
}
],
"ok" : 1
}
tripleo:SECONDARY> rs.remove('10.19.94.13:27017')
{
"ok" : 0,
"errmsg" : "replSetReconfig command must be sent to the current replica set primary."
}
tripleo:SECONDARY>
Note:
I was able to delete it (although with errors shown) when I connected to primary:
{
"_id" : 1,
"name" : "10.19.94.13:27017",
"health" : 0,
"state" : 8,
"stateStr" : "(not reachable/healthy)",
"uptime" : 0,
"optime" : Timestamp(1462580916, 5),
"optimeDate" : ISODate("2016-05-07T00:28:36Z"),
"lastHeartbeat" : ISODate("2016-05-07T03:57:01Z"),
"lastHeartbeatRecv" : ISODate("2016-05-07T00:33:26Z"),
"pingMs" : 0,
"syncingTo" : "10.19.94.16:27017"
},
{
"_id" : 2,
"name" : "10.19.94.14:27017",
"health" : 1,
"state" : 2,
"stateStr" : "SECONDARY",
"uptime" : 18687,
"optime" : Timestamp(1462593277, 6),
"optimeDate" : ISODate("2016-05-07T03:54:37Z"),
"lastHeartbeat" : ISODate("2016-05-07T03:57:06Z"),
"lastHeartbeatRecv" : ISODate("2016-05-07T03:57:06Z"),
"pingMs" : 0,
"syncingTo" : "10.19.94.16:27017"
}
],
"ok" : 1
}
tripleo:PRIMARY> rs.remove('10.19.94.13:27017')
2016-05-07T03:57:19.541+0000 DBClientCursor::init call() failed
2016-05-07T03:57:19.543+0000 Error: error doing query: failed at src/mongo/shell/query.js:81
2016-05-07T03:57:19.545+0000 trying reconnect to 10.19.94.16:27017 (10.19.94.16) failed
2016-05-07T03:57:19.547+0000 reconnect 10.19.94.16:27017 (10.19.94.16) ok
(In reply to Alexander Chuzhoy from comment #3) > Two things in "Delete the failed node from MongoDB" section: > 1) The IPs (shown in the output and used for connection) in the guide differ: > sudo netstat -tulnp | grep 27017 > tcp 0 0 192.168.201.47:27017 0.0.0.0:* LISTEN > 13415/mongod > [heat-admin@overcloud-controller-0 ~]$ mongo --host 192.168.0.47 > > Added fix for this. > tripleo:PRIMARY> rs.remove('10.19.94.13:27017') > 2016-05-07T03:57:19.541+0000 DBClientCursor::init call() failed > 2016-05-07T03:57:19.543+0000 Error: error doing query: failed at > src/mongo/shell/query.js:81 > 2016-05-07T03:57:19.545+0000 trying reconnect to 10.19.94.16:27017 > (10.19.94.16) failed > 2016-05-07T03:57:19.547+0000 reconnect 10.19.94.16:27017 (10.19.94.16) ok This error should be normal and should be what's expected. Might be a good idea to add a note on this item. As for the PRIMARY vs SECONDARY issue, I'll add a note for that too. Something along the lings of: ===IMPORTANT=== You must run the command against the PRIMARY replica set. If you see the following message: "replSetReconfig command must be sent to the current replica set primary." Relog into MongoDB on the node designated as PRIMARY. Verified. This section of the doc looks good. Changes now live on the customer portal. |