Bug 1723750

Summary: Some tests failing in the regression regularly, causing spurious failures, and slowing down development period.
Product: [Community] GlusterFS Reporter: Amar Tumballi <atumball>
Component: coreAssignee: bugs <bugs>
Status: CLOSED NOTABUG QA Contact:
Severity: urgent Docs Contact:
Priority: high    
Version: mainlineCC: amukherj, bugs, dkhandel, hgowtham, jahernan, moagrawa, pasik, pkarampu, rkothiya, sunkumar
Target Milestone: ---Keywords: Tracking
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-02-10 18:02:16 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1723673    

Description Amar Tumballi 2019-06-25 09:46:46 UTC
Description of problem:

Some tests which are failing regularly are:

* ./tests/basic/afr/arbiter-mount.t (if NFS is not available).
* ./tests/basic/afr/tarissue.t - randomly
* ./tests/bugs/gfapi/bug-1319374-THIS-crash.t - fails in distributed, and LCOV tests.

There are more but the above are more common suspects:

./tests/bugs/snapshot/bug-1482023-snpashot-issue-with-other-processes-accessing-mounted-path.t
./tests/bugs/gfapi/bug-1319374-THIS-crash.t
./tests/basic/all_squash.t
./tests/basic/ec/nfs.t
./tests/00-geo-rep/00-georep-verify-setup.t
./tests/basic/quota-rename.t
./tests/basic/volume-snapshot-clone.t


Version-Release number of selected component (if applicable):
master

How reproducible:
>50%

Steps to Reproduce:
1. send a patch, run regression.


Actual results:
One of these tests would fail.

Expected results:
No tests should fail.

Additional info:
We should have test cases which are specific, and would as predictably as possible. Mark the tests as bad and debug them separately not blocking the development of the master branch.

Blocking other patches from getting in, we would be slowing down a lot in these tests.

Comment 1 Amar Tumballi 2019-06-25 09:49:25 UTC
> Blocking other patches from getting in, we would be slowing down a lot in these tests.

Correction: We would be slowing down the project's development if we have unstable tests.

Comment 2 Worker Ant 2019-06-25 10:00:46 UTC
REVIEW: https://review.gluster.org/22940 (tests: mark some tests as KNOWN_ISSUE) posted (#1) for review on master by Amar Tumballi

Comment 3 Deepshikha khandelwal 2019-06-25 13:05:16 UTC
./tests/basic/distribute/non-root-unlink-stale-linkto.t is also failing frequently when creating user.

Comment 4 Nithya Balachandran 2019-06-25 13:19:02 UTC
(In reply to Deepshikha khandelwal from comment #3)
> ./tests/basic/distribute/non-root-unlink-stale-linkto.t is also failing
> frequently when creating user.

The test is not unstable. Gluster has no control over the useradd. We need to figure out what is going wrong on the regression machines.

Comment 5 Amar Tumballi 2019-06-26 07:17:28 UTC
./tests/bugs/rpc/bug-954057.t needed a retry.

Comment 6 Worker Ant 2019-06-26 12:08:15 UTC
REVIEW: https://review.gluster.org/22949 (test: Fix spurious failure in bug-1482023-snpashot-issue-with-other-processes-accessing-mounted-path.t) posted (#1) for review on master by MOHIT AGRAWAL

Comment 7 Sunny Kumar 2020-02-10 18:02:16 UTC
Closing this bug as purpose of this bug is completed.