Bug 809194 - 404 Response Codes from Internally Linked URLs
Summary: 404 Response Codes from Internally Linked URLs
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: OKD
Classification: Red Hat
Component: Website
Version: 2.x
Hardware: Unspecified
OS: All
low
low
Target Milestone: ---
: ---
Assignee: Fabiano Franz
QA Contact: libra bugs
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2012-04-02 17:41 UTC by Shawn Purtell
Modified: 2015-05-15 01:07 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2012-04-27 20:45:54 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
List of URLs responding with a 404 code (3.86 KB, text/csv)
2012-04-02 17:41 UTC, Shawn Purtell
no flags Details

Description Shawn Purtell 2012-04-02 17:41:47 UTC
Created attachment 574576 [details]
List of URLs responding with a 404 code

Description of problem:
Several (currently 33) internal links on the OS community site are returning 404 (file not found) response codes when accessed.

Version-Release number of selected component (if applicable):
N/A

How reproducible:
Visit any of the 33 URLs in the attached .csv file.

Steps to Reproduce:
1. Visit any page on the list of URLs (e.g. https://www.redhat.com/openshift/community/documents)
  
Actual results:
404 response code

Expected results:
Active page on https://www.redhat.com/openshift/community/

Additional info:
This data was accessed via a Google Webmaster Tools crawl report.
Issues affect both SEO and user experience (broken links).

Comment 1 David O'Brien 2012-04-12 02:20:06 UTC
Any links to (e.g.), "OpenShift_Express-*" need to be replaced with the latest versions, already published on the Community site, that no longer use "Express" as part of the name.

Comment 2 Fabiano Franz 2012-04-12 17:45:04 UTC
Lowering the severity since this is not related to source code so this don't block stg build. Drupal content changes right in production.

Comment 3 Fabiano Franz 2012-04-12 17:51:33 UTC
This is only a bug if the related URLs are linked *from* any of our pages. Please note that GWT reports every 404 *including* the URLs linked only from external sites that links to us. 

For some of the most important ones we can create redirects in order to keep important external articles or blog posts working. But for most of them, it would be nice to know which of our own pages links to those that are returning 404. Is that kind of information available?

Thanks!

Comment 4 Shawn Purtell 2012-04-12 18:20:17 UTC
Internal link information is available within the GWT UI for each of these URLs, but is not easily exported. I can give you direct access to the data via a Google account if that works.

Comment 5 Fabiano Franz 2012-04-12 18:23:18 UTC
(In reply to comment #4)
> Internal link information is available within the GWT UI for each of these
> URLs, but is not easily exported. I can give you direct access to the data via
> a Google account if that works.

It does, my google account is: fabianofranz.

Comment 6 Shawn Purtell 2012-04-12 18:36:25 UTC
(In reply to comment #5)
> (In reply to comment #4)
> > Internal link information is available within the GWT UI for each of these
> > URLs, but is not easily exported. I can give you direct access to the data via
> > a Google account if that works.
> 
> It does, my google account is: fabianofranz.

Great - you now have access via http://www.google.com/webmastertools.

Information is under Diagnostics >> Crawl Errors. Click on a specific URL to access internal link information on the 'Linked From' tab.

Comment 7 Fabiano Franz 2012-04-12 19:17:11 UTC
The most important 404s has been fixed with URL redirects right in production. Examples:

 * /documents to /documentation
 * /forums/express to /forums/openshift
 * all guides and documents
 * broken FAQs
 * broken feature requests
 * many others

We can expect them to be removed from GWT Crawl Errors in some time. The ones related to forums threads has not been fixed when we understand the threads has been closed (not Express to OpenShift renaming problems), in that case a 404 is actually expected.

Thanks!

Comment 8 Yujie Zhang 2012-04-16 10:11:22 UTC
(In reply to comment #7)
According to your comments, tested this bug on product, it has been fixed now, thanks.


Note You need to log in before you can comment on or make changes to this bug.