Bug 1489145 - [RFE][Tracker] Provide tools to analyze snapshot issues and cross check database and storage metadata.
Summary: [RFE][Tracker] Provide tools to analyze snapshot issues and cross check datab...
Keywords:
Status: CLOSED DUPLICATE of bug 1739106
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: distribution
Version: 4.1.5
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ovirt-4.4.0
: 4.4.0
Assignee: Benny Zlotnik
QA Contact: Avihai
URL:
Whiteboard:
: 1580130 (view as bug list)
Depends On: 1557147 1678969 1739106
Blocks: 902971
TreeView+ depends on / blocked
 
Reported: 2017-09-06 19:09 UTC by Marina Kalinin
Modified: 2021-06-10 12:57 UTC (History)
14 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
undefined
Clone Of:
Environment:
Last Closed: 2020-02-20 20:23:04 UTC
oVirt Team: Storage
Target Upstream Version:
Embargoed:
lsvaty: testing_plan_complete-


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Bugzilla 1469683 0 medium CLOSED [BLOCKED][RFE] Provide tool to modify volume metadata values 2021-06-10 12:37:47 UTC
Red Hat Knowledge Base (Article) 3740351 0 None None None 2019-10-29 04:48:49 UTC
oVirt gerrit 87822 0 'None' MERGED dump_volume_chains: add volume sizes 2020-09-25 12:57:58 UTC
oVirt gerrit 88059 0 'None' ABANDONED sos: include storage domains volume metadata 2020-09-25 12:57:58 UTC
oVirt gerrit 88108 0 'None' MERGED dump_volume_chains: improve error text 2020-09-25 12:57:58 UTC
oVirt gerrit 88109 0 'None' ABANDONED contrib: add tool to visualize and search metadata 2020-09-25 12:57:58 UTC
oVirt gerrit 89098 0 'None' MERGED dump-volume-chains: Separate chain analysis 2020-09-25 12:57:58 UTC
oVirt gerrit 89099 0 'None' MERGED dump-volume-chains: dump metadata in json format 2020-09-25 12:57:58 UTC
oVirt gerrit 89149 0 'None' ABANDONED sos: do not analyze chains in dump-volume-chains 2020-09-25 12:57:58 UTC
oVirt gerrit 89150 0 'None' MERGED dump-volume-chains: dump metadata in sql format 2020-09-25 12:58:02 UTC
oVirt gerrit 89272 0 'None' ABANDONED dump-volume-chains: optionally output to a file 2020-09-25 12:58:02 UTC
oVirt gerrit 90527 0 'None' ABANDONED sos: run dump-volume-chains only on spm 2020-09-25 12:57:58 UTC
oVirt gerrit 91205 0 'None' ABANDONED dump-volume-chains: accept multiple outputs 2020-09-25 12:57:57 UTC
oVirt gerrit 91906 0 'None' MERGED sos: introduce dump-volume-chains option 2020-09-25 12:58:02 UTC
oVirt gerrit 91979 0 'None' MERGED sosreport: dump-volume-chains in one host per pool 2020-09-25 12:57:57 UTC
oVirt gerrit 92153 0 'None' MERGED sos: only dump volume chains from data domains 2020-09-25 12:58:02 UTC
oVirt gerrit 95257 0 'None' ABANDONED tool: add dump-qemu-chain command 2020-09-25 12:57:57 UTC
oVirt gerrit 95258 0 'None' ABANDONED tool: add set-image-chain command 2020-09-25 12:57:57 UTC
oVirt gerrit 96053 0 'None' MERGED dump-volume-chains: dump volume type in sqlite 2020-09-25 12:57:57 UTC
oVirt gerrit 96086 0 'None' MERGED dump-volume-chains: dump disk type in sqlite 2020-09-25 12:57:57 UTC
oVirt gerrit 96154 0 'None' ABANDONED WIP: contrib: add cross_check_sd script 2020-09-25 12:58:01 UTC
oVirt gerrit 96213 0 'None' ABANDONED sos: dump volume chains in sqlite format 2020-09-25 12:58:01 UTC
oVirt gerrit 96214 0 'None' ABANDONED capture dump volume chains in sqlite format 2020-09-25 12:57:57 UTC
oVirt gerrit 96377 0 'None' ABANDONED dump-volume-chains: dump all data sd in sqlite 2020-09-25 12:57:56 UTC
oVirt gerrit 103380 0 'None' ABANDONED tools: add a discrepancy checker tool 2020-09-25 12:58:03 UTC

Internal Links: 1469683

Description Marina Kalinin 2017-09-06 19:09:19 UTC
We need to have a tool to analyze VM snapshots and to allow an easy, comprehensive way to fix any broken snapshot issue.

Ideally, it will be a UI tool, accessible from the Admin Portal. It will provide a visual view of the current snapshot state as it is expected in the database and how it is actually represented on the storage.

The tool will analyze a given VM / VM disk for snapshots health. 
It will show any problems in a visual way for clear understanding.
And, it will also suggest an automated fix for the found problem(s).

We can start with a command line tool and move to a UI one later on. 

Main reason for this RFE is that analyzing snapshots is a very time consuming operation and it will save a lot of man hours if we had this automated.

Comment 4 Marina Kalinin 2018-02-16 20:26:06 UTC
Reopening, since I disagree with this decision.
This tool is definitely something that we need in the long run to get more time efficient in solving any potential snapshots and alike issues.
Just now a new bug was filed by CEE with similar request: bz#1545251.


However, we may consider implementing such a tool inside CEE.
This is how I envision it:

Step 1: direct the user to provide all the required information for understanding the full picture (database, lvm output, metadata, qemu-img check, etc).

Step 2: build a visual picture of each disk chain, both from database and from the storage. Does not need to be fancy, but should show the chains in a clear way and separate each disk from another. Ideally, visualize each snapshot as well.

Step 3: print the commands to run on customer environment to fix the problem.

Step 4: [dream] integrate this tool with RHV UI and provide an option to fix the problem with one or two clicks.

Comment 5 Marina Kalinin 2018-02-16 20:26:40 UTC
Gordon, can you please help defining step 1?

Comment 6 Yaniv Kaul 2018-02-16 20:36:54 UTC
Please move to CEE if you are interested in implementing this.

Integration of the scripts/tool into RHV is of course encouraged. Especially suited to closed environments. 
Integration into the UI - I would not bother, but could be done via UI plugin?

Comment 10 Yaniv Lavi 2018-06-11 08:21:13 UTC
Do you want to backport this to 4.2.z?

Comment 11 Germano Veit Michel 2018-06-11 22:30:23 UTC
(In reply to Yaniv Lavi from comment #10)
> Do you want to backport this to 4.2.z?

No yet, maybe once we finish the data collection patches.

Thanks

Comment 12 Marina Kalinin 2018-07-23 15:01:12 UTC
*** Bug 1580130 has been marked as a duplicate of this bug. ***

Comment 13 Marina Kalinin 2018-07-23 15:47:07 UTC
Germano, based on your comment here:
https://bugzilla.redhat.com/show_bug.cgi?id=1580130#c5
Shall we change the title of the bug to be more inclusive?

Comment 14 Germano Veit Michel 2018-07-23 22:53:18 UTC
(In reply to Marina from comment #13)
> Germano, based on your comment here:
> https://bugzilla.redhat.com/show_bug.cgi?id=1580130#c5
> Shall we change the title of the bug to be more inclusive?

Looks better? I also added a BZ that will provide another tool to help fixing these issues....

Comment 15 Sandro Bonazzola 2019-01-28 09:36:34 UTC
This bug has not been marked as blocker for oVirt 4.3.0.
Since we are releasing it tomorrow, January 29th, this bug has been re-targeted to 4.3.1.

Comment 17 Avihai 2019-05-17 13:46:25 UTC
Please retarget this bug to 4.3.5/4.4 as it depends on Bug 1678969(in NEW state + targeted to 4.3.5) and Bug 1557147(targeted to 4.4)

Comment 22 Germano Veit Michel 2019-10-29 04:28:06 UTC
Benny is working on it, in https://gerrit.ovirt.org/#/c/103380/

Comment 24 Marina Kalinin 2020-02-20 20:23:04 UTC

*** This bug has been marked as a duplicate of bug 1739106 ***


Note You need to log in before you can comment on or make changes to this bug.