Hide Forgot
* Observed when using the following spec file (on Milpitas cluster) 1: volume remote1 2: type protocol/client 3: option transport-type tcp 4: option remote-host n1 5: option remote-subvolume p3 6: end-volume * The following logs are repeated at approximately every 10 seconds: [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] remote1: DNS resolution failed on host n1 [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. Going offline until atleast one of them comes back up. [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] remote1: DNS resolution failed on host n1 [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. Going offline until atleast one of them comes back up. [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] remote2: DNS resolution failed on host n4 [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. Going offline until atleast one of them comes back up. [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] remote2: DNS resolution failed on host n4 [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. Going offline until atleast one of them comes back up. [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] remote1: DNS resolution failed on host n1 [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. Going offline until atleast one of them comes back up. [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] remote1: DNS resolution failed on host n1 [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. Going offline until atleast one of them comes back up. [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] remote2: DNS resolution failed on host n4 [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. Going offline until atleast one of them comes back up. [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] remote2: DNS resolution failed on host n4 [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. Going offline until atleast one of them comes back up. It would be better if we could print the error logs only once.
(In reply to comment #0) > * Observed when using the following spec file (on Milpitas cluster) > > 1: volume remote1 > 2: type protocol/client > 3: option transport-type tcp > 4: option remote-host n1 > 5: option remote-subvolume p3 > 6: end-volume > > > * The following logs are repeated at approximately every 10 seconds: > > [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: > getaddrinfo failed (Name or service not known) > [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] > remote1: DNS resolution failed on host n1 > [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. > Going offline until atleast one of them comes back up. > [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: > getaddrinfo failed (Name or service not known) > [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] > remote1: DNS resolution failed on host n1 > [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. > Going offline until atleast one of them comes back up. > [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: > getaddrinfo failed (Name or service not known) > [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] > remote2: DNS resolution failed on host n4 > [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. > Going offline until atleast one of them comes back up. > [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: > getaddrinfo failed (Name or service not known) > [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] > remote2: DNS resolution failed on host n4 > [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. > Going offline until atleast one of them comes back up. > [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: > getaddrinfo failed (Name or service not known) > [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] > remote1: DNS resolution failed on host n1 > [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. > Going offline until atleast one of them comes back up. > [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: > getaddrinfo failed (Name or service not known) > [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] > remote1: DNS resolution failed on host n1 > [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. > Going offline until atleast one of them comes back up. > [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: > getaddrinfo failed (Name or service not known) > [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] > remote2: DNS resolution failed on host n4 > [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. > Going offline until atleast one of them comes back up. > [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: > getaddrinfo failed (Name or service not known) > [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] > remote2: DNS resolution failed on host n4 > [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. > Going offline until atleast one of them comes back up. > > > > > It would be better if we could print the error logs only once. The log messages are printed while trying to re-connect. we can suppress the logs to make it print at one place, either at gf_resolve or af_inet_client_get_remote_sockaddr.
(In reply to comment #1) > (In reply to comment #0) > > * Observed when using the following spec file (on Milpitas cluster) > > > > 1: volume remote1 > > 2: type protocol/client > > 3: option transport-type tcp > > 4: option remote-host n1 > > 5: option remote-subvolume p3 > > 6: end-volume > > > > > > * The following logs are repeated at approximately every 10 seconds: > > > > [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: > > getaddrinfo failed (Name or service not known) > > [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] > > remote1: DNS resolution failed on host n1 > > [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. > > Going offline until atleast one of them comes back up. > > [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: > > getaddrinfo failed (Name or service not known) > > [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] > > remote1: DNS resolution failed on host n1 > > [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. > > Going offline until atleast one of them comes back up. > > [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: > > getaddrinfo failed (Name or service not known) > > [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] > > remote2: DNS resolution failed on host n4 > > [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. > > Going offline until atleast one of them comes back up. > > [2009-06-11 23:52:54] E [common-utils.c:102:gf_resolve_ip6] resolver: > > getaddrinfo failed (Name or service not known) > > [2009-06-11 23:52:54] E [name.c:242:af_inet_client_get_remote_sockaddr] > > remote2: DNS resolution failed on host n4 > > [2009-06-11 23:52:54] E [afr.c:2223:notify] replicate: All subvolumes are down. > > Going offline until atleast one of them comes back up. > > [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: > > getaddrinfo failed (Name or service not known) > > [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] > > remote1: DNS resolution failed on host n1 > > [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. > > Going offline until atleast one of them comes back up. > > [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: > > getaddrinfo failed (Name or service not known) > > [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] > > remote1: DNS resolution failed on host n1 > > [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. > > Going offline until atleast one of them comes back up. > > [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: > > getaddrinfo failed (Name or service not known) > > [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] > > remote2: DNS resolution failed on host n4 > > [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. > > Going offline until atleast one of them comes back up. > > [2009-06-11 23:53:04] E [common-utils.c:102:gf_resolve_ip6] resolver: > > getaddrinfo failed (Name or service not known) > > [2009-06-11 23:53:04] E [name.c:242:af_inet_client_get_remote_sockaddr] > > remote2: DNS resolution failed on host n4 > > [2009-06-11 23:53:04] E [afr.c:2223:notify] replicate: All subvolumes are down. > > Going offline until atleast one of them comes back up. > > > > > > > > > > It would be better if we could print the error logs only once. > > The log messages are printed while trying to re-connect. we can suppress the > logs to make it print at one place, either at gf_resolve or > af_inet_client_get_remote_sockaddr As of 2.0.6rc4, the message below gets repeatedly printed [2009-08-10 23:56:54] E [common-utils.c:102:gf_resolve_ip6] resolver: getaddrinfo failed (Name or service not known) [2009-08-10 23:56:54] E [name.c:242:af_inet_client_get_remote_sockaddr] remote1: DNS resolution failed on host localhost1
need to verify if it is already fixed in the log cleanup
This is still seen with 3.0.5rc1
*** Bug 867 has been marked as a duplicate of this bug. ***
PATCH: http://patches.gluster.com/patch/3242 in master (Adding GF_LOG_OCCASIONALLY to prevent repeated log messages)
PATCH: http://patches.gluster.com/patch/3250 in release-3.0 (Adding GF_LOG_OCCASIONALLY to prevent repeated log messages)