How to Map 404 URLs at Scale with Sentence Embeddings via @hamletbatista

Subscribers:
4,200
Published on ● Video Link: https://www.youtube.com/watch?v=Vzkl2zYNGQ8



Category:
Tutorial
Duration: 4:01
11 views
0


Reported today on Search Engine Journal

For the full article visit: http://tracking.feedpress.it/link/13962/13043927

How to Map 404 URLs at Scale with Sentence Embeddings

One surefire way to help clients gain more SEO traffic is to redirect valuable URLs that end up in 404s to equivalent ones.

These URLs generally still get traffic, have valuable external links coming in or both.

One lazy and ineffective approach to map 404 URLs is to redirect all of them to the home page or a dynamic search result. 🤦

For example, Smarthome 302 redirects non-existing pages to its home page. This type of redirection is generally flagged as soft 404 errors in Google Search Console.

The correct approach is to map each one individually to an equivalent page if such a page exists.

However, this process can be very tedious, time-consuming, and expensive if you need to do it manually.

Oftentimes, you need to rely on the default internal search engine of the site, which is rarely any good.

In this column, we will learn how to automate this valuable technique using a neural matching approach.

Here is our plan of action:

Downloading URL Sets

There are many ways to get 404 URLs. You could run a website crawl, download 404s from Google or Bing Search Consoles, etc.

One of my favorite places to get 404 URLs, is the Ahrefs Broken Backlinks tool because it filters 404s to pages with external links.

Google Search Console will likely have far more 404s to map, though. If you rather map all 404s and have more than one thousand to download, you might want to consider using our Cloudflare app which has no such limits.

You can export up to 100,000 URLs or as many as you have when you connect it to Google Drive.

Next, you need a set of all valid website URLs, preferably canonical URLs.

One simple way to get such a list is to download the XML sitemap URLs.

If your client doesn't have XML sitemaps, you can perform a tradi




Other Videos By Colin Boyd SEO


2019-12-06FTC officially rules that Cambridge Analytica deceived Facebook users
2019-12-06This AI text adventure game has pretty much infinite possibilities
2019-12-06Carbon markets could strengthen or screw over global action on climate change
2019-12-06Bernie Sanders aims to break up ISP and cable monopolies
2019-12-06Twitch’s Dr Disrespect is developing a TV series based on his streaming persona
2019-12-06Windows on ARM gets two new Qualcomm chips for budget laptops
2019-12-06Bernie Sanders unveils $150 billion plan to expand high-speed internet access
2019-12-06Google Assistant adds topical podcast search and photo sharing via voice
2019-12-06Facebook sues Hong Kong company that used "celeb baiting" to hijack accounts
2019-12-06Marriage Story brings rom-com energy to the agony of divorce
2019-12-06How to Map 404 URLs at Scale with Sentence Embeddings via @hamletbatista
2019-12-06Making weird go viral with Hi Stranger creator Kirsten Lepore
2019-12-06Rocket Lab tests key maneuver needed for reusability during 10th flight to space
2019-12-06Qualcomm's new Snapdragon XR2 is a 5G-compatible chip for mixed reality headsets
2019-12-06From lava lamps to moon balls: the kids’ Christmas gift guide
2019-12-06Cashmere, cologne and hair curlers: your essential fashion, beauty and grooming gift guide
2019-12-06The PlayStation 4’s Share button changed the way we play together
2019-12-05Scammers peddling Islamophobic clickbait is business as usual at Facebook
2019-12-05FCC won’t punish Verizon and T-Mobile for exaggerating their coverage maps
2019-12-05OnePlus built a piano out of 17 phones because why not
2019-12-05Review: Driving the track-ready, race-banned McLaren Senna GTR