1469259 – Async Hypervisor Checkin payload should be checked on submission

Bug 1469259 - Async Hypervisor Checkin payload should be checked on submission

Summary: Async Hypervisor Checkin payload should be checked on submission

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	Candlepin
Classification:	Community
Component:	candlepin
Sub Component:
Version:	2.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	low
Target Milestone:	---
Target Release:	---
Assignee:	Abhishek Kumar
QA Contact:	Katello QA List
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2017-07-10 18:27 UTC by Shayne Riley
Modified:	2020-02-19 16:37 UTC (History)
CC List:	6 users (show)
Fixed In Version:	candlepin-3.1.4-1
Clone Of:
Environment:
Last Closed:	2020-02-19 16:37:55 UTC
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Github	/candlepin candlepin pull 2590	0	None	None	None	2020-02-20 15:14:55 UTC

Description Shayne Riley 2017-07-10 18:27:46 UTC

Description of problem:
It is possible for a client to submit an async hypervisor checkin and get a successful 200 response back without ever realizing that their payload is incorrect, which would normally return a 400 for other requests.

BUT, if the client is prudent enough to use the job id from the response and retrieves said job, it is only then that the client knows that the job failed, but that's it.

Additionally, even if the client was able to look at the logs and find the stack trace for it (which it can't), the stack trace doesn't make any mention of the fact that the payload wasn't formatted right. Instead the stack trace is for a Job Excecution Exception "Caused by: java.lang.NullPointerException: null
at org.candlepin.pinsetter.tasks.HypervisorUpdateJob.toExecute(HypervisorUpdateJob.java:214)"

If one were to look up that line (214) in source it reads:
log.debug("Hypervisor consumers for create/update: {}", hypervisors.getHypervisors().size());

But the crazy part is in the line JUST BEFORE that (line 213):
HypervisorList hypervisors = (HypervisorList) Util.fromJson(json, HypervisorList.class);
...which indicates that Util.fromJson(String, Class) EATS the exception that Jackson throws, and returns null instead (and looking at that source, the analysis appears to be correct).

To sum up the issue in its entirety:
- A client can submit an async hypervisor checkin with 1 or more random characters of text, as long as the owner is real, and gets back a 200 success, even though the job is doomed to fail.
- No sanitization of the client input takes place before it is stored into the DB. Rather, there is only an insufficient check for null or empty text, at which point the text is compressed via Deflate and then stored.
- Once the job starts, the text is decompressed and then deserialized via Util.fromJson method, which eats an exception and returns null.
- HypervisorUpdateJob.toExecute doesn't check to see if the resulting HyervisorList hypervisors variable is null, NOR does it check to see if hypervisors.getHypervisors() is null (which can happen if the customer sends an empty json object for a payload: {}
- An NPE is thrown, and the job state becomes "FAILED"
- The client checks the job, and finds out that the job is failed, but has no idea why.

Version-Release number of selected component (if applicable):
2.0.37

How reproducible:
Always

Steps to Reproduce:

1. Submit any non-JSON formatted text to "Content-Type: text/plain" POST $CPHOST/hypervisors/$owner ... make sure the owner exists, though.

OR, Submit this JSON formatted text: {}
..it'll have the same effect.

2. Use the job id from the response and look up the job: GET $CPHOST/jobs/$jobId

Actual results:
After Step 1, a 200 response is returned.
After Step 2, the state of the job is "FAILED" with no explanation why (Hint: it was the client's fault).

Expected results:
The payload is checked BEFORE getting stored in the DB for correctness and a 400-series status is returned with a sufficient explaination why it was no good.

Note You need to log in before you can comment on or make changes to this bug.