Friday, May 11, 2012

How To Survive Google's Unnatural Links Warnings & Avoid Over-optimisation

Google’s actions against “overoptimised” sites have been intensified and it seems that the recent updates are able to detect various links-related overoptimisation violations that can result in short or long-term positional drops, with or without warnings. Even though Google announced the Penguin update on the 24th if April 2012, changes on Google’s link evaluation system have been noticeable several weeks before the public announcement.

It appears that in some cases Google has been overzealous, hitting (temporarily) even websites with rather natural link profiles. In other cases, Google admitted algorithmic classification mistakes, publishing apologetic messages like this one Matt Cutts posted on Google+.

According to Patrick Altoft (branded3) the sites that have received unnatural links notifications fall under five main categories, and they are not just the ones participating in link exchanges or other types of link networks. Rand made a Whiteboard Friday video about 6 changes every SEO should make before the overoptimisation penalty hits. Now that Google has rolled out the Penguin update against "black web spam," making sure that your website's backlink profile does not violate Google's quality guidelines is quite essential, especially if your traffic has gone down.

This is a follow up to the post ‘How to Monitor Your Website For Link Equity Loss’, which can be used to identify backlinks from low quality or penalised/deindexed websites. However, this post intends to cover the following links related overoptimisation cases:

A) Excessive Link Acquisition

B) Site-wide links detection

C) Unnatural Anchor Text Distribution

D) Unnatural Spread of Links Authority

A. Excessive Link Acquisition Check

Acquiring a high number of links over a short period of time has never been a good practice and webmasters need to keep an eye on the levels of acquired links, especially these days that negative SEO seems to become more of an issue. Phrases like the following one from Dan Thies, seem to be heard more often lately:

"Both sites have received “unnatural links” messages in Webmaster Tools. Neither site has had a “link building” campaign ever. By using 3rd party tools (e.g. Majestic) I can see a lot of unnatural links pointing at both sites, but I didn’t put those links there"

There are two quick ways to check your site's link acquisition velocity, using Ahrefs or Majestic SEO.

Extremely  High Link Acquisition Velocity (Ahrefs) 

Unnatural Link Acquisition Velocity (Majestic SEO historic index, cumulative view)

B. How To Check For Site-wide Links

A high number of blog-roll, header, footer or sidebar links can trigger Google’s “overoptimisation” wrath and keeping them to a minimum would be a rather reasonable thing to do. Certainly some site-wide links may have occurred naturally but the less "overoptimisation" signals you site sends out, the better. There are a few ways to quickly check your website against site-wide links with the quickest one being Webmaster Tools.

Under 'Your site on the web' -> Links to your site WMT list the domains that link the most to your site. The ones that link several times should be flagged as potential site-wide links and be manually checked.

Using Webmaster Tools to detect site-wide links is a rather easy and quick way. However, because WMT don't report all backlinks Google actually see, for a more thorough investigation a third party link intelligence service should be used such as Majestic SEO, Open Site Explorer, Ahrefs etc. One thing to bear in mind using any 3d party service, is that their crawlers do not try to replicate Google's behaviour, therefore in some cases the data can be significantly skewed. This is particularly the case for links from web sites that Google has removed from its index but the 3d party services will still report as normal links. 

A section in the Ahrefs FAQ page reads: “Having the full information at hand you may decide on subjective estimates of links and figure out real situation with the given website or page”. In a similar manner a Majestic SEO rep commented that:

 "Our index is independent of Google and will remain so. If Google has banned a site, it does not mean there are no longer links to that site. We map the link graph – not Google’s interpretation of it."

C. How To Check For Unnatural Anchor Text Distribution

Overoptimised anchor text seems like a ticking bomb, especially after Google made public the following two messages:

"Tweaks to handling of anchor text. [launch codename "PC"] This month we turned off a classifier related to anchor text (the visible text appearing in links). Our experimental data suggested that other methods of anchor processing had greater success, so turning off this component made our scoring cleaner and more robust."

"Better interpretation and use of anchor text. We’ve improved systems we use to interpret and use anchor text, and determine how relevant a given anchor might be for a given query and website."

This task is quite more complicated because, ideally, you need as much data as possible. Exporting anchor text data from as many different data sources as possible is strongly recommended e.g. Majestic SEO, Ahrefs, Open Site Explorer, Sistrix, Blekko. 

Next you would need to filter the data removing the following:

Dead links - These are sites that no longer link to your site but used to link in the past. Filtering out the dead links is absolutely necessary and some 3d party services offer such tools. Otherwise, proprietary link checkers can be used like the one we use at iCrossing UK - Alex Ovsianikov's creation. Counting dead links into a backlinks audit can result in wrong conclusions.Deindexed linking root domains - It's rather pointless carrying out a backlinks audit for Google including links from sites that Google has deindexed. NetPeak Checker, makes this task quite easy as it is explained on this post.No follow links - These are unlikely to cause any overoptimisation issues and could be discountedSite-wide links - These should be counted once, otherwise the anchor text distribution will be greatly skewed. Different services treat site-wides differently; hence you need to pay extra attention at how each service treats them.

After having applied the above filters, the remaining backlinks data could be analysed for different anchor text types such as:

Exact match targeted keywords e.g. hr softwareBroad match keywords e.g. online hr software systemBrand terms e.g. BreatheHR, www.breatheHR.comKeyword + brand terms e.g. Breathe HR software systemImage linksOther e.g. 'click here', 'this site' and other natural anchor text that doesn't fall under any of the above categories.

Having classified all different anchor text variations, it is now relatively easy to spot weaknesses - pay particular attention for spikes on exact match keywords as in the following graph:

Another area where overoptimisation can occur is when backlinks are consistently gained from authoritative domains. Tom Anthony created a handy link profile tool to detect such anomalies.In the following example the links authority of the site represented by the blue line seems quite unnatural compared to the backlinks authority spread of the other 4 sites, which seems far more natural. Any high spikes towards the middle of the graph could potentially be flagged by Google as suspicious attempts of PageRank maninpulation. Proactively carrying out the above checks will help identify weaknesses on a site's backlink profile and be better prepared for Google's current and forthcoming "overoptimisation" updates.As with any new algorithmic update there will be winners and losers. Google have acknowledged that and there is a feedback form for web masters who feel that their site should not have been affected by the Penguin update.If all the above fails, then try this petition where site owners are urging Google to kill the penguin update.

About the author
Modesto Siotos (@macmodi) works as a Senior Natural Search Analyst for iCrossing UK, where he focuses on technical SEO issues, link tactics and content strategy. Modesto is happy to share his experiences with others and posts regularly on digital marketing blog Connect.


View the original article here

Penguins, Pandas, and Panic at the Zoo

Google’s war on lovable critters escalated on April 24th with the release of the “Penguin” update (originally dubbed the “webspam update” by Google). While every major algorithm update causes some protest, post-Penguin panic seems to be at near record levels, worsened by weeks of speculation about an “over-optimization” penalty. Webmasters and SEOs are understandably worried, and many have legitimately lost traffic and revenue. Before you go out and burn your website to the ground for fear of a penguin in the pantry, I want to offer some advice on how to handle life after an algorithm update.

1. What We Know

First, let’s review what we know. I’m going to break the rules of blogging and recommend that you stop and read this level-headed Penguin post by Danny Sullivan. It covers some of the basics and is the most speculation-free post I’ve read on the subject so far. Glenn Gabe also had a good post on potential Penguin factors.  There’s still a lot of speculation, but likely culprits include:

Aggressive exact-match anchor textOveruse of exact-match domainsLow-quality article marketing & blog spamKeyword stuffing in internal/outbound links

Many people have suggested low-quality link profiles in general, but analysis of Panda has been complicated by Google’s recent attack on link networks, which seems to have been manual and has probably been going on for weeks. The overlap has made analysis difficult, so let’s take a quick look at the timeline.

The official roll-out date for Penguin was April 24th, and it seems to have rolled out, for the most part, in a single day. Unfortunately, it came on the heels of other events. On April 19th, Panda 3.5 rolled out (most likely a data update). On April 16th, a data glitch caused a number of sites to be mistakenly tagged as parked domains. Throughout April (and weeks before Penguin), Google started sending out a large number of unnatural link notices via Google Webmaster Tools. Sadly, it seems that April really was the cruelest month.

Google officially claimed that Penguin impacted about 3.1% of English queries, compared to Panda 1.0’s 12%. Since rankings change daily – even hourly – even with no updates, these numbers are nearly impossible to confirm, but it does appear that the impact of Penguin was immediate and substantial. This is an internal SEOmoz graph of Top 10 ranking changes around April 24th (please note that the Y-axis is scaled to accentuate changes):

Graph of Top 10 changes (Penguin vs. Panda 3.5)

Pardon the slightly cryptic nature of this graph – it’s for an upcoming project – but the core point is that the impact of Penguin dwarfed either Panda 3.5 or Google’s 4/16 glitch.

In a word: no. Penguin wasn’t accidental, and Google is clearly serious about combatting spam tactics that have been lingering for too long. As you can see from the graph, it doesn’t appear that there were any major reversals in the few days since Penguin rolled out. Does that mean Google won’t make ANY adjustments? Of course not – it’s entirely likely that they’ll continue to tweak Penguin.

For comparison’s sake, remember that Panda 3.5 came 14 months after the initial launch of Panda 1.0. We’ve come a long way since the monthly “Google Dances” of 2003. Keep in mind, though, that Panda was somewhat unique – we believe that it feeds multiple variables into a single ranking factor that gets updated outside of the real-time index. There’s currently no compelling evidence to suggest that Penguin works in the same way. The Penguin update appears to be integrated directly into the main algorithm, like a more traditional Google update.

Update (May 11, 2012): Google seems to have confirmed that Penguin is operating similarly to Panda, with data updates occuring outside of the main index. It appears that my initial assumptions on that were incorrect. Danny Sullivan has a good follow-up piece on Penguin today, with some direct feedback from Google. It's not good news, for the most part, but it's a very useful read.

Given the overlapping timelines, this advice applies to any Google update, and not just Penguin. The algorithm is changing constantly (Google reported 516 changes in 2010, and that rate seems to be accelerating), and I want to give you the tools to survive not just Penguin, but Zebra, Skunk, Orca, and any other black-and-white animals Google can ruin…

I’m not trying to be condescending or to minimize any losses you may have suffered. Over 17 years of working with clients, I’ve learned that panic almost never makes things better. No matter how hard Penguin hit you, you need to stop, take a breath, and assess the damage. Dig into your analytics and find out exactly where you sustained losses. Segment your data (by channel, engine, keyword, and page) as much as possible. It’s not enough to know that you lost traffic – you need to be an expert on exactly which traffic you lost.

Even though the overlapping timelines make analyzing the core Penguin factors difficult, the actual timeline when Penguin rolled out is clear. If you saw major traffic losses between Tuesday, April 24th and Wednesday, April 25th, odds are good that Penguin is at least part of the problem.

I can’t tell you how many times I’ve been involved in a Q&A or consulting situation where a website owner was 100% sure they had been hit by an algorithm update, only to have their 17th message to me go something like this:

Oh, by the way, our site was down for 3 days a couple of weeks ago, right before our rankings dropped. I’m sure this wasn’t the problem, but I just thought I’d let you know.

Um, erp, what?! I’ve died a little inside so many times from messages like this that I’m not sure that I’m technically still human. Especially if your losses weren’t sudden or don’t match the algorithm timeline precisely, make absolutely sure that nothing happened to your site or changed that could impact Google’s crawlers. One of the worst things you can do in SEO is to spend a small fortune solving the wrong problem.

Likewise, make sure that you know exactly what SEO efforts are underway, not just within your own team but across any 3rd-party contractors. I’ve had clients swear up and down that everything they did was completely white-hat only to find out weeks later that they hired an outside link-building firm and let them loose with no accountability. Make absolutely sure you know what every agent under your control did in the weeks leading up to the algorithm update.

Panic leads to drastic action, and while I don’t think you should sit on your hands, bad choices made under uninformed hysteria can make a bad situation much, much worse. I’m not speaking hypothetically – I’ve seen businesses destroyed by overreacting to an algorithm change. Here are a few words of advice, once you’ve taken that deep breath (don’t forget to start breathing again)…

It’s unclear how Penguin may have penalized links, or if recent reports of link-related issues are tied to other April changes, but regardless of the cause, the worst thing you can do is to start simply hacking at your back-links. Even low-quality back-links can, in theory, help you, and if you start cutting links that aren’t causing you problems, you could see your rankings drop even farther.

I highly recommend this recent interview with Jim Boykin, because Jim has freely admitted to dabbling in the gray arts and he knows what he’s talking about when it comes to risky link-building. Tackling your problem links is incredibly tough, but start with the worst culprits:

Known, obvious paid linksLinks in networks Google has recently delistedFooter links with exact-match anchor textOther site-wide links with exact-match anchors

Whenever possible, deal with low-authority links first. If a link is passing very little authority AND it’s suspicious, it’s a no-brainer. Cutting links is tough (see my tips on removing bad links) – if you don’t have control over a link, you may have to let it go and focus on positive link-building going forward.

One complaint I hear a lot in Q&A is that the “wrong” page is ranking for a term. So, to get the “right” page to rank, the well-meaning SEO starts de-optimizing the page that’s currently ranking. This usually means turning a decent TITLE tag into a mess and cutting out keywords to leave behind Swiss-cheese copy. Sometimes, the “right” page starts ranking again. Other times, they lose both pages and their traffic.

“Over-optimize” is a terrible phrase, and that alone has people in a panic. There’s nothing “optimal” about jamming a keyword 87 times into 500 words of copy and linking it to the same affiliate site. “Over-gaming” would be a better word. You think you figured out the rules of the game, so you pounded on them until there was nothing but a pile of dust on the board.

If you think you’ve played the game too aggressively, step back and look at the big picture. Does your content serve a purpose? Does your anchor text match the intent of the target? Do your pages exist because they need to or only to target one more long-tail variations of a term? Don’t de-optimize your on-page SEO – re-optimize it into something better.

While I don’t think reconsideration will doom you, Penguin is an algorithmic change, not a manual penalty, and reconsideration is not an appropriate avenue. If you think you were impacted by the recent crackdown on link networks, IF you have removed those links, and IF you aren’t engaged in other suspicious link-building, you might consider requesting reconsideration. Just make sure your house is in order first.

Google has created a form for sites unfairly hit by Penguin, but it’s unclear at this point whether that form will result in manual action, or if Google is just collecting broad quality data. If you sincerely believe that you’re an accidental victim, then feel free to fill the form out, but don’t base your entire recovery strategy on clicking [Submit].

Fix What You Can Fix

Recently, I had a long debate with a client about whether or not they had been hit by a specific algorithm update. In the end, it was a pointless debate (for both of us), because we had two clear facts: (1) organic traffic had fallen precipitously, and (2) there were clear, solvable problems with the site. From a diagnostic standpoint, it definitely helps to know whether you were hit by Penguin or another update, but after that, you have to fix what's in your power to fix. Don't spend weeks trying to prove to management that this was all Google's fault. Isolate the damage, find the problems you can fix, and get to work fixing them.


View the original article here

Whiteboard+ on Google's Penguin Update

Yesterday, I filmed some quick thoughts on Google's Penguin update. You can find the full video on our Google+ page:

Google Penguin Update Whiteboard+

In it, I cover a few unique items about Penguin:

It's (weirdly) not focused on improving search qualityIt appears to affect some of the worst spam (but not all) and some very light forms of spam/manipulation (oddly)Not tied to on-page or on-site necessarily, though outlinks may be looked at and several other updates occurred at similar times (making it tough to reverse engineer what might have caused a penalty)Appears to affect a disproportionate number of web service industry sites (though that could be correlation, not causation)Not yet clear if this a rolling update (though there are signs it may be)Left a lot of very strange, "empty" types of results in many of the spammiest verticals/SERPs

I wanted to crosspost about it here so those asking for my opinions about Penguin could check it out. Look forward to some great discussion on G+ (or here in the comments). Oh, and if you haven't encircled SEOmoz on Google+... You totally should! We've got another WB+ video coming out very soon :-)


View the original article here

Are You Setting Up WordPress For SEO Success?

The author's posts are entirely his or her own (excluding the unlikely event of hypnosis) and may not always reflect the views of SEOmoz, Inc.

what chu talkin bout wordpress

He is talking about WordPress, yes?

If you've ever tried to optimize WordPress for SEO success you've probably said those exact words at some point... some crazy theme breaks something, or a plugin crashes the whole site, or in terms of SEO you get 971 duplicate pages back from your crawl report. 

But I don't think your troubles with WordPress are your fault entirely. I've been there too when I was first learning it! Gary Coleman has been there. But this post is an opportunity to move on from that...

There's a lot of well meaning yet misguided info out there. After over two years of battling with (umm... using...) WordPress, I know it can be tricky and frustrating at times, and so I wanted to create a guide that might help clear some of this up.

I'm not here to get into every single little detail and variation, but rather to spend time on the core WordPress features and give special focus on SEO related WordPress issues.

Clear up some confusion about WordPress terminologyExplain that WordPress, being a dynamic CMS, is built on relationships (as in "relational database") - and explain those relationshipsShow you some hands on, practical tips for setting up your WordPress site with an SEO focusGive you a few ways to cross check SEOmoz's crawler diagnostics with other sourcesGet rid of that 'ol Gary Coleman look!

For This Post, Let's Assume

We're running wordpress.org (the self hosted version)This is a single author site (to keep it simple, although not hard to extend the concepts to multi-author)We're not doing any ecommerce, photo galleries, or anything else you'd find in a more custom application of WordPress.We're using Yoast's SEO for WordPress plugin.

Alright. Everyone ready? LET'S GO!! ....What Chu TALKIN' Bout WordPress?!

Explanation of some of the most common terms

Let's get really basic here for a minute, hope you don't mind. But I think a lot of people may confuse/interchange a WordPress page with a Web Page.

A web page is a single HTML document that exists at a unique URL. Even if the extension is .php or .asp. The underlying source code is still HTML. This is a WEB page. It does not matter HOW it was created - it loads in your browser as an HTML document and that's all you need to know. And for the rest of this post, when I say "web page" I'm talking about any HTML document existing at a URL.

But a WordPress page is WordPress's version of a "static" page. In fact, anytime you're talking about a page in the context of WordPress, put the word "static" before "page" = "static page" and it will always make more sense.

This is the second thing people either usually confuse, or have a hard time grasping. To your credit, I think it's confusing that they're put side by side in documentation, as if they're somehow similar. They're not at all!

post vs page

Note that pages and posts differ entirely in how they function.

A post is dated and "time-sensitive" and a page is not.A post can belong to categories, tags, dates and authors and a page can not.You can access a post from multiple pages - its category, tag, date or author.A page is only accessible from where ever you link to it.

Some additional references about pages vs. posts:

Ah. Another sticky point for folks. Some may argue, but I think Yoast would agree. Categories are for your main 5-7 "buckets" of topics that your posts fall into. Tags are there to fine-tune categories, and are usually much more specific that categories.

categories vs tags

Also, you should NOT have a category that is the same as a tag or vice versa. Categories should all be unique from tags.And, categories can have hierarchy and tags have no hierarchy.

author archives

dated archives

pagination

Yeah... why is this confusing? The only thing that doesn't paginate... are PAGES!!  ....WHAT CHU TALKIN' BOUT??'

This part will show you how the different elements within WordPress relate and interact with one another.

Not much to 'splain here (I hope by now!).

Pages are like regular, non-blog pages on a website.They can have a hierarchy.They will not go into the RSS feed.

Use Pages For The Following Types Of Content

An "About Us" sectionIf a dentist, say a page about "dental implants" describing your service.If a restaurant, your Menu Page.Directions pageFees page etc.

Think of "Many To Many" relationships in databases. 

You can put a post in many categories. And of course a category can hold many posts.

post tag relationship in wordpress

You can put the same tag on many posts.

date an author archives relationships

Dates are simple. If you view a date archive by month, all the posts from that month appear within that date archive.For our single author blog setup, since every single posts is by the same author, that's what you'd get when viewing that archive (which is why we 301 redirect it to the blog homepage).This is showing you, you can arrive at the same post from multiple places.

recent of popular posts access

And this is showing you, for the most recent posts, or popular posts, sometimes there is a link in the sidebar - and of course the blog home IS a feed of the most recent posts.

subpages in wordpress

All of these web pages can have subpages off of them.

Bonus - For the Truly Geeky

I found this awesome template of the hierarchy within WordPress and loading a page. Not necessary to know for what we're doing here, and not 100% relevant either, but I found it really useful, especially if you like to know more about what's going on behind the scenes.

decide this stuff

This is sort of a "I wish I knew then" chart. Things that would be useful to know up front, such as;

Decide your categories at the beginning.Decide what you want the homepage of your blog to be early on.When you create a user account, choose the username wisely, because this is the URL and can not be changed afterwards (don't get stuck with "admin"!)

accessability

This chart is showing you what page types should be accessible to the user and to the search engines.So unless otherwise noted, the page type can be indexed and followed.

where to link to from menus

This is the general rule of thumb I follow for deciding what links to put where. In general

I put pages and categories in main menusI put categories, recent/popular posts, dated archives, and maybe tags in the sidebar/widget.

url title and description control in wordpress

URL control can be confusing, because some are set in odd places, or called "slugs".

Page and Posts URLs get set within the page/post editorCategory and tag URLs get set in their respective menus under "slug"Author URLs are the "username"

If you've got everything set up correctly, it should be EASY to get your titles and descriptions in check.

Title and description templates get set in YoastTitles and descriptions at the individual page/post level are set in that page/post editor with Yoast.Need help writing a title? Use this post I did about writing titles.

This is where things get tricky, because a lot of themes tend to break perfectly good WordPress install. Or they try to handle SEO stuff when they shouldn't. Or, you get a theme, and a plugin and WordPress all handling title tags and it becomes a mess.

DO use themes for design elements;

ColorsFonts,Page layoutHeadersFootersBasic social media button stuff

Do NOT use themes for SEO stuff, such as

IndexationAnalytics codesTitles and descriptionsRSS feedsMenu structure (ideally this is done with WordPress Custom Menus)

Let the Yost SEO plugin handle this stuff! Shut off / do not use these types of SEO functions within the themes. 

There are two plugins I always install right away for pure SEO stuff;

I often see other plugins that try to set SEO settings - so be sure you're only managing SEO with one thing!

yoast title settings

Yoast SEO has the ability to assign a title and description template for every possible page, post and archive - so I advise using Yoast to manage all title and description templates.

As noted: Don't forget to update your header.php file to include the correct title code;

title in header for yoast seo

A note about the 'sitename' variable - this is the site title under settings>general

sitename yoast

indexation setting yoast seo

This follows all of the best practice procedure from above. Tag, author, and date archives will all look too similar to other content. So it does not make sense to have them indexed.Please note: Want to reiterate - this is what I typically use for a standard WordPress setup - one author, standard blogging format, or a business website with a blog inside etc. You may find yourself in a different circumstance if you have multiple authors, ecommerce etc.Also - if your blog has already existed for some time, and you've been indexing tags all along for example, you shouldn't just go deindexing them. Look in analytics, see how much traffic they might be bringing you, if that traffic is quality, and make a well thought out decision about if/how to move away from indexing tags.

archive and robots settings yoast seo

Since running a single author blog, disabling the author archives 301 redirects them back to the blog homepage. This is good for the engines AND the user since they look exactly the same.I like letting users browse posts in the dated archivesNot best practice to add noodp/noydir to every page - but the plugin allows you to do it for individual pages/posts in the editor.

xml sitemap in yoast

Make sure you don't have any other plugins or your theme handling the sitemap.Check off what you don't want included in the XML sitemap. (This is usually the same as what you are NOT indexing).

permalink settings in yoast

One thing I LOVE about Yoast's plugin - you can strip /category/ off the folder structure for categories. AWESOME! You should definitely do this. If the site has already been indexed with /category/ redirects are automatically created.You could redirect images to their parent post or page. I usually don't but it won't do any harm if you do.Unless you're running something with https (secure pages) you can just leave canonical settings as default.

This is THE most common question we get in Q&A. Duplicate content issues. Basically I want to give you guys some extra tools and resources for checking duplicate content issues re: WordPress and the Moz crawl report.

A lot of folks get concerned when they see "47 duplicate page titles found" etc, and with understanding!

If you've set everything up as above correctly, there isn't a whole lot of room for error. But sometimes things happen and stuff breaks or we miss something.

And most times, no matter the issue, ensuring you have things setup as described above in the post, will fix things.

Check webmaster tools. If they are not reporting duplicate page titles or descriptions, you probably have little to worry about. Moz might have picked up on pages that were crawlable but not being indexed. But definitely check back in with webmaster tools in a week or so (its healthy to check webmaster tools once a week anyway!)

I honestly love the Moz crawl report. Its turned up some important things to fix for me at times. Yet I think its just smart with ANY tool to cross check, especially if it involves a big error like duplicate content.

Use the free version of Screaming Frog to crawl up to 500 pages (and the paid version is unlimited).

Crawl the siteClick on titlesSelect DuplicatesYou'll see a report like this:

duplicate titles subpages

In this case we can clearly see subpages are causing a lot of the duplicate title issues. 

Just because a crawler like Screaming Frog or the SEOmoz crawler crawls pages, does not mean they are indexed. Check Google's index to find out with these queries.

site:mydomain.com/blog - check for blog indexationsite:mydomain.com/category - check for category indexation (unless you've stripped from folder structure)site:mydomain.com/tag - check to see what tags are indexedsite:mydomain.com/author - check to see if author archives are indexedsite:mydomain.com/2012 - check to see what dated archives from 2012site:mydomain.com/ inurl:page - check for subpages being indexed (see example below)

subpages indexed query

If you also find errors in webmaster tools, screaming frog, or Google's index:

Identify which page type it is (category, tag, dated archive, author archive, or subpages)Determine if the page should be indexed to begin with.If it should be indexed, make sure you have a setting in WordPress to generate unique titles/descriptions from the template.If it should NOT be indexed, block it using Yoast and be sure you don't have to do any 301 redirects

I know that's a little overly simplistic - it'd be tough to cover every possible variation of errors within this post - but that general framework is what I would advise to follow.

gary coleman wordpress dance

No seriously. I know WordPress can be challenging - but I hope this guide has helped give you a better understanding of its different functions, and how to resolve some common issues on your own.

Got questions? If you lead them with "What chu talkin' bout!?" I'll answer (within reason - only short 3-4 sentence answers possible here). NO specific site questions here please, just general concept questions.

Please take any detailed or site-specific questions on over to the Moz Q&A.

Or... ask me questions at MozCon! That's right, I'll be at MozCon, as an attendee, so if you're there you can track me down and ask away!

Thhhannnnnks!


View the original article here

The Noob Guide to Link Building

The author's posts are entirely his or her own (excluding the unlikely event of hypnosis) and may not always reflect the views of SEOmoz, Inc.

NOOB GUIDE TO LINK BUILDING

The Noob Guide to Online Marketing is arguably the greatest single post of all time. If you don’t agree, well, it’s at least my favorite. Oli Gardner (of Unbounce) displayed a playful writing style mixed with pixel perfect graphic design, and a GPS of a roadmap to take your site from mile marker zero to one hundred in six months. It’s nothing short of amazing.

While savvy content marketers realize that many of Oli’s tactics will naturally attract links, fledgling link builders got to the 63rd page and were still wondering what to do. With this companion piece, it is my goal to grab the baton from Oli and outline a six-month link building action plan for your brand or client’s new web property. Even if the website isn’t brand spanking new, that’s fine, what I really mean is that this is the link building plan for the less savvy looking to do dive into off page optimization. Marketers with long existing sites and more link building experience will be better served downloading the Complete Six-month Off Page SEO Gameplan from iAcquire.

Following this guide in concert with Oli’s you will identify your audience, build a list of prospects, plan and execute four successful pieces of content and convince influencers to create content for your site.

Download both Link Building Guides

Since we last spoke I left Publicis Modem to become the Director of Inbound Marketing at iAcquire which is a technology-focused off page seo agency. I encourage you to read the “Quantifying Outreach” study that I released at LinkLove London wherein I examined nearly 300k outreach emails from both our own iRank platform and Buzzstream’s CRM software. The study will help you optimize your outreach emailing tactics and understand why treating people like people rather than prospects is a far more effective tactic than sending form letters.

For those keeping score at home this falls under both the Content Strategy/Development and Social Strategy phases of the New SEO Process.

Link Building Philosophy

For many, link building is a numbers game and it quickly becomes clear why those people would rather put their resources into black hat tactics. Those marketers are too impatient to properly build links because link building is a process wherein you are convincing people who don’t know you to take a real world action that benefits you. To do that at scale requires a budget, great understanding of people, a large outreach team and a commitment to creating content that people will actually be compelled to link to or embed into their sites. In other words, you either have to make friends or make news.

In this age of 2pac Holograms, stop motion action figure videos, and augmented reality utilities how do you compete? While that type of content is awesome, it represents the type of big swings that may not be in your wheelhouse or relevant to your brand/client so often people wonder how to build links in their otherwise boring niche.

Naturally, there are ways to manually submit your site to millions of forums, blog comments, and directories, but those links are generally very low quality and have been the focus of algorithm updates such as Penguin. That is not to say that these tactics don’t work, but just as you should diversify your traffic sources beyond just Search, you will want to diversify your link building tactics to build a varied and natural link portfolio.

Sites with unnatural link profiles create a footprint that is easy to identify from a 10,000 foot view and of course Google has that perspective. Don’t put yourself on their radar by engaging in spammy tactics.

Anchor Text Distribution

This metric, which is the number of times an anchor in your backlink profile occurs, is best measured using tools like Open Site Explorer, MajesticSEO, Ahrefs, etc., is very important. An ex-Googler told me at SMX Australia to always be sure the highest occurring anchor text for a site is branded otherwise you may trigger an algorithmic filter or penalty.

Watch out for black hats!

Link Equity

The value of the links you build to your site is not a trivial thing. Links are the lifeblood of Search campaigns and therefore the foundation on which every site that is visible in the Search Engine Results Pages (SERPs) must be built.

Link Building Pro Tip: Share of Voice

These numbers will not be an exact determination of what you will need to accomplish as there is a sliding scale of worth for links, but this crude explanation will make it clear to higher ups what needs to be done and why.

Use the aforementioned backlink profiling tools to determine how many links your competitors have and how many links you have. Whether you only use one tool or all tools, make sure that you stay consistent. Backlink profile tools all measure a different portion of the web and none of them are as comprehensive as Google so it’s important to use the same source(s) to capture a snapshot for every site.
Calculate Share of Voice.Share of Voice Equation
Share of voice is a traditional advertising term that basically means of the percentage of opportunity that a given brand occupies. If there are 10 TV advertising slots for a given TV show and a company ran ads that filled five of those slots for that show, they have a 50% Share of Voice.

Ham Sandwich SERP

For the keyword [ham sandwich] the site HamSandwichMusic.Com has the highest SoV because it’s in the #1 position. The keyword [ham sandwich] has a local search volume of 33,100 which means the largest amount of traffic you can get from that keyword (according to industry standards) is 18.2% which is 6024.2 visits. So if my traffic for that keyword monthly is only 500 visits, I have an 8.3% SoV for [ham sandwich].

Determine the number of links required for you to beat your competitors. That is to say if the first position has 100 links and you have five, make the case that it will take you 96 links to get the share of voice that your top competitor has. Again, this is not exactly indicative of what it will take to beat your competitor because you may surpass them with less links that are higher quality or it may require more links and competitors will continue to build links, but to build an easily understood case use share of voice.

Justifying Share of Voice

Tools

The specific tools required for each month are called out in this guide however there are tools that may be used at any point for a variety of reasons. Every tool mentioned in this guide is free or at least has a free tier and/or a free trial that will allow you at least start working.

Rapportive – This Gmail Plugin is for identifying what users social profiles are connected to their email address. This is key because reaching out to people via social media first is far more effective than emailing them first.Boomerang – This Gmail plugin allows you to schedule and automatically follow up on emails.Microsoft Excel – This spreadsheet application is in invaluable for collecting, slicing and dicing data. Get to know it well with Distilled’s Excel Guide to SEO.SEOmoz Toolbar – The SEOmoz toolbar is a must-have for a variety of reasons, but in the case of building links you can find out whether a site meets your SEO metric requirements as you go.Scraper for Chrome – Stop copy and pasting data one piece at a time, use this plugin to quickly scrape data off a page and port it right to Google Docs.Google Refine – The data you collect will often be wonky or at least not in the format that you want it to be in and for some things Excel just doesn’t cut it. This tool by Google helps you clean it up.SpyOnWeb – Using this tool you can quickly figure out what domain names or IPs share the same space and what sites have the same AdSense ID. In other words you can use SpyOnWeb to identify whether your prospect is a part of a link farm.Link Detective – This link classification tool quickly lets you determine what types of links you or your competitor has. Take a CSV from Open Site Explorer and throw it into the system and find out the makeup of a link profile.Link Indices – A link index gives you the links that are linking to a given page or site. This is invaluable intel for tracking progress and identifying opportunities and deficiencies. Each of the following link indices has its own strengths and weaknesses, so give them all a try and determine which one fits your needs the best just like Branko Rihtman did:Open Site ExplorerMajesticSEOAhrefs

 Description: C:\Users\mike\AppData\Local\Microsoft\Windows\Temporary Internet Files\Content.Outlook\AQH9GQKD\title-planning-quick-hits-alt.png

This month in the Noob Guide to Online Marketing Guide you are setting up your online presence and the tools to measure it. Simultaneously, you should be planning out your link building campaign.

Oli’s guide talks about launching an editorial calendar at Month Two, but I suggest you start thinking about when you’re rolling your content out before you start anything. You should of course be thinking about what’s hitting the blog when, but content takes time and careful planning to create and then launch.

When planning your link building you should do what it takes to differentiate yourself and your content because you are competing with the whole web. Making your content stand out will make building links substantially easier.

With that in mind you will be launching the following content:

Egobait (Month Two) – This is content that includes and flatters your key influencers to in turn encourage them to participate by linking to or sharing your digital assets.Data Visualization (Month Three) – Everyone’s go-to version of this is an infographic, but consider other ways of visualizing the data such as a book (Storybird), a timeline (dipity) or a videographic if possible. In month three you will be launching your own.

Google Don't Be Evil Video

E-Book and Guest Post (Month Four) – By the time the fourth month rolls around, the key influencers in your space should know about you or your client and what you bring to the table. At this point an e-book is great way to solidify you or your client’s thought leadership. Much like this guide and Oli’s, you will be coupling that e-book with a blog post on an influential site in your space to be sure that it is seen.

The content should have a centralized theme that links back to your business goals, but you want it to be different enough so that your growing audience will want to share all of it.

Audience/Influencer Personas – Understand who your target audience is by creating personas. Personas are general representations of your audience. Typically you should create four personas for the whole site campaign and while you can make as you like, four are more easily managed.

These personas don’t have to be particularly in-depth (for example we don’t need to determine their need states) nor validated through measurement, they just have to help you create a mental model for content ideation, prospecting and outreach. While a more in-depth form of this falls under the Opportunity Discovery model, for the purposes of link building a basic understanding of who is potentially out there is all that is required. This will make it easier to associate concepts and keywords with one segment of the audience rather than another and then ultimately allow you to tailor outreach specifically to that segment in order to scale the messages. Social Listening – Use tools like Topsy, Social Mention, Amplicate and Google Discussion Search to identify user segments based on keyword searches and usage. Starting from basic keyword searches that you have identified in the Adwords Keyword Tool identify who is contributing to that conversation. In cases where the conversation is particularly spammy, download the tweets and clean them up using Google Refine.
Facebook Ad Creator – The Facebook Ad Creator is to audience research what the Google Adwords Keyword Tool is to Keyword Research. It is an invaluable tool to understand the inventory of people that fit a very precise demographic. This tool will be the validation required for the personas you are building. If you hypothesize from social listening that there is a user segment out there between 30 and 40 and is interested in search engine optimization, you can then take that to the Facebook Ad Creator and see how valid that is. The only limitation is that you can’t examine the interests together using “AND.” You can only use examine the inventory using “OR.”

Facebook Ad Creator

Prospecting – Finding people who will be interested in your content has never been easier. Now that you have identified the type of people that you want links from and split them into targets and influencers you have a fairly precise idea of what they are into. Break these into defining keywords and run searches in:

You can also prospect specifically for sites within Google by using a variety of search queries. Additionally, there are prospecting tools such as Citation Labs and Ontolo, but these require paid subscriptions.

Keep track of these users by persona type so you can later segment your outreach and create more directly tailored form letters — if that is your thing. If you’re prospecting specifically by site you can also keep track of those sites by the segment that those sites target.

Original Content Ideation – Even with an incredible Creative team amazing content is not scalable. Even Don Draper’s team had far more misses than makes, but what Sterling Cooper didn’t have is Social Media. GoFish – This is a tool that I built to perform real-time keyword research in order to help identify co-relevant ideas that are already occurring on Twitter and gives you a list of users that have tweeted those keywords. I gave an example in my “Targeting Humans” talk where I put in the keyword “Michael Jackson” and it identified the keywords “Jackson trial” and “south park” as occurring together very often. A content idea with a built-in audience would be “The Michael Jackson Doctor Trial” played out with South Park characters. More information on how this tool works can be found in my “Using Social Media to Get Ahead of Search Demand” post.
Facebook Ad Creator – You might be surprised to see this tool twice. In addition to being great for segment inventory it’s also great for inferring content ideas that will resonate with those people. For example if we found that target segment of XXX is interested in [A].[B],[C] can we can infer that they would be interested in a __________________. This is the same idea as with GoFish, but it allows for more evergreen concepts.
Pre-Contacting – At this point you have your ideas and a strong list of prospects. Strike up a conversation and tell them about the content you’re working on. Ask them for feedback due to their expertise or interest in the subject matter. Update them regularly along the way to make sure they are still onboard so when the content launches you will have a warm rapport with these people. Keep track of who was interested in your spreadsheet.

The following quick hits can be done at any point in any month. Many of them are one-offs, others will benefit from continued engagement. They are placed here so that you can get wins and show movement to the powers that be in order to get continued buy-in for your link building campaigns.

Get Indexed Fast with Google+

Ask for Links in Mailing List Signups – Encourage users to link to your website when sending out the confirmation email for your mailing asking them or including embed code.
Profile Links – Many Web 2.0 profiles allow you to implement a “DoFollow” link or links that pass link equity. When claiming your brand on social networks be sure to include links from your profile. Facebook Fan PageTwitterYouTubeGoogle+ Profile
Directory Submission – Directory submissions are generally considered the lowest form of link building and generally won’t help you move the needle. That said you can easily identify directories with keyword searches plus directory type e.g. “3d tv rss directory.” Again, you can show numbers with these, but they are generally not worth your time. RSS DirectoriesNiche DirectoriesPaid DirectoriesArticle DirectoriesBlog DirectoriesDesign Galleries
Better Business Bureau – The BBB has a site with an incredibly high domain authority, it’s a widely trusted site in both the eyes of the search engines and the consumer. Getting linked from this site is fairly easy and very valuable.
Chamber of Commerce – Similar to the BBB, the local chamber of commerce provides the same type of value and ease for obtaining links.
Join the Conversation – Typically this is called “link dropping,” but that is only in cases where people place links without joining in the conversation contextually. If you are adding to the conversation (as seen on SEOmoz posts) this is a viable tactic for link building. Message Boards/Blogs – Find these with keyword searches in Google’s discussion search and Social Mention.QuoraLinkedInYahoo Answers

This month in the Noob Guide to Online Marketing Guide you are building your social media followings and soon you will be expected to seed users. That’s much easier said than done, but one of the best ways to do that is by getting your influencers to put your content in front of their users. The best way to accomplish this is by launching egobait which is content made to flatter, include or get the attention of specific users while also providing utility to your audience,

Crowd Sourced Posts – Reach out to influencers and ask them what they think about current events. This makes them feel important and allows them to voice their opinion on topics without having to write a full post themselves and truthfully allows you to create content with little effort.
Best of Lists – Curate a list of the greatest content, writers, things in your space and give props to your target influencers. Another way to quickly make content that will make the rounds and their audience will start to engage with you.
Interviews – Have a Q&A session with a popular thought leader over the phone using Google Voice and you will quickly have a transcript of your interview and your influencer will have piece of content they are proud to share and link to.
Description: PF Comic #3 - Mo' Money featuring J. Money from Budgets are SexyAwards/Badges – Gillian Muessig said it best “no one gave us the authority to launch the Web 2.0 awards, but we did and people still try to enter them to this day.” Launch your own awards in your space and send the people you want to link to you badges that they can use to link back to you. You can take this even further by launching an unbranded microsite and then tricking you competitors into linking to you.X-Factor – There’s an infinite amount of ways to flatter people. Without picking up a copy of the Art of Seduction I’m sure you can creatively figure out how to get influencers to come out to play. Here’s an example from a Credit Card Finder site in Australia where they created a comic book series starring the top financial bloggers in their space.

This is a great opportunity to revitalize a dormant content idea that never quite took off. Do you have creative content that fell to the wayside because the characters never took a life of their own? Take your influencers and breathe new life into an old idea by making them the stars.

Month 3: Data Visualization

What Happens Online When You Die?

The infographic is largely misunderstood as a piece of automatically viral content. The reality is that the infographic has been done to death so unless you have an active built-in community yours requires a substantial launch plan and push. The Noob Guide to Online Marketing Guide places the launch of an infographic at Month Six, but to get the most mileage out of it as a noob, I’m placing data visualization at Month Three. Feel free to launch an additional infographic at Month Six.

In 2012, data visualization should be presented as a Maximum Viable Product. A great example of such is an Australian site in the life insurance space called Life Insurance Finder who launched a very impressive and successful piece of link bait called “What Happens Online When You Die.”

Don’t just build the infographic, build a data visualization experience that exhausts the available digital assets. Create a video counterpart to that infographic, and highlight the source data to build a higher barrier to entry for those that will look to steal your success. Even right now you’re thinking that’s a lot of work; well think of how competitors will feel once that work is finished. Zappos has over 30,000 videos for its products; it is very difficult to compete with Zappos in video search because they got there first. Be the Zappos of the topic you visualize.

Link Building Pro Tip: Maximum Viable Products

Simply put, some people want infographics, some people want a page and some people want a video and the performance of the “What Happens When You Die Online” campaign is very indicative of that. Here are the numbers:

Page – 258 Links, 86 Root Domains, 172 Facebook Shares, 145 Tweets, 51 +1sVideo – 81,072 views, 2071 Facebook Shares, 69 links, 10 Root Domains, 2601 Tweets, 61 +1sInfographic – 86 Links, 28 Root Domains, 31 Facebook Shares, 99 Tweets, 16 +1s

Make it easy to link to you, give the people what they want.

Seed to Your Influencers – You have pre-contacted influencers for this very reason. Spell out their involvement in your content so they are compelled to link to it, share it and otherwise endorse it.
Reach out to your Targets – The list of prospects you’ve created have been waiting for this moment. Ideally, you will reach out to them first in social media and then escalate to email.
Contact Those Already Linking to You – Pull your existing links using a link index tool and then inform all of the relevant people that already link to you that you have launched a new piece of incredible content.
Leverage Your Mailing List – At this point your mailing list should be full of people that want to find out about your content. Inform them of your new content and include the relevant embed code to make it easy for users to link to you.
Add to the Conversation – Head back to the forums, blogs, message boards and Q&A sites and contextually add to the on-going conversation and when it makes sense add a link to your content.

Month 4 - Release An Ebook

By now you and/or your team have written the most incredible e-book your niche has ever seen with the best graphic design and interesting if not new insights on your subject. Luckily, you’ve saved some room for your new influential buddies to get a piece of the action and enough tangential or cutting room floor content to spread it around and get the most mileage out of it.

Exclusive Release – Align with an influential site in your space that gets a lot of traffic and offer your E-Book as an exclusive download.
Guest Post – A good guest posting opportunity typically serves two purposes. First, you are writing content for a third-party website wherein you can drop as many exact match anchor text links as you like. Secondly, you have an opportunity to leverage the eyeballs of users that frequent the site.
Foreword – Invite a key influencer to write the foreword of the book so their name can be attached to your promotions in social media and on other influential websites.
Quotes – Reach out to thought leaders in the space for quotes, similar to the pre-contacting done in the first month asking thought leaders for quotes keeps them aware of the process that you are making a book and once the book arrives you can easily reach out to those influencers and request a link or promotional support.
Reviews – Reach out to bloggers that specifically write book reviews in your space. Simply search for “[keyword] book review.” For example, if I were to write an SEO e-book I would certainly reach out to this gentleman, Ian Lurie, since he has reviewed an book about SEO in the past.
Ian Lurie is a good gentleman into SEO Books
(I just egobaited Ian here. Now when I tweet about it I can put "featuring @portenint" in the tweet)

Month 5: Host a Blog Contest

In Month Six Oli suggests holding a contest in social media. I’m going to move that up one month in order to couple it with the event you will be throwing in Month Six. I’m also going to take another page out of Oli’s book and suggest this be a blogging contest.

The concept is quite simple:

Reach out to influencers in your space that are awesome writersConvince them to write their best workOffer awesome prizesDecide the winners based on social metrics

You may be few thousand dollars lighter from the prizes you pay out, but you also have a ton of great content from thought leaders in your space which then turns into more linkable assets. You also have a ton of social shares that put your content in front of those influencers’ followers in social media.

Does it work? Well Unbounce ran the same contest and here’s the leaderboard:

Conversionfest 2011 Leaderboard

The posts led to a combined 20,000 unique visitors during the respective two week scoring periods of each post and twenty one posts that continue to drive substantial traffic and links for Unbounce.

At this point the content on your site will be robust enough to make linking to you easy and worthwhile. The following tactics will allow you to continually identify contextual prospects and grease the wheels for any ongoing outreach link building:

Set up Google Alerts – Use your brand name, the names of people involved in the business and target keywords as the queries that you are targeting in Google Alerts. If someone has mentioned you and not linked to you, quickly ask them for a link.
Ifttt– Set up automatic alerts using If That Then This for when the brand, keywords or guest post URLs are mentioned in social media. Reach out to those people and encourage them to link to you.
LinkStant – Find out when someone is linking to you as they are writing their post and ask for updated anchor text.
Image Search – Google’s image search allows you to search for sites that have embedded your infographic and request that they cite their source by linking to you.
Video Search – Sometimes people re-upload videos to YouTube and that causes the views to be split between them. Search for those videos on YouTube and then search for the URL in Google to find sites that have embedded your video and request that they switch videos and link to you.

Once the competition is over, revisit your badge strategy by sending all entrants a badge and encouraging them to link back to their post.

Month 6: Throw an Event

You may have heard that the best way to get someone to link to you is to buy them a beer; throwing an event is the scaled version of that. Throwing a successful event naturally generates a lot of fanfare, promotion and chatter that will also lead to links.

First, you must decide what type of event you want to put on:

Ask People You Know To Link To You – The people that you specifically invite to your event should be the type of people that you want to link to you. Show them a good time and encourage them to write about your event after the fact.

Press Releases - These have been abused by SEOs on PRWeb and Businesswire for content launches and link building but they are most effective in the case of launching an event.
Reach Out Directly to Journalists – Find journalists with the keyword searches “columnist for [publication]” and “writes for [publication].”Invite these people out and show them a great time.
Handwritten Notes – Follow up after the event is over with handwritten notes so potential linkers remember you and your hospitality.The SEER Method - For your grand sendoff use the SEER Interactive method; pull your full list of followers using SimplyMeasured and pull your complete backlink profile with MajesticSEO, expand any shortened URLs and do VLOOKUPs to determine what users are following you, but not linking to you and reach out to them.
Find Who Shared Your Stuff – Similarly, using Topsy, pull the profiles for all the people that shared your URLs and perform a VLOOKUP to see what users shared your content but didn’t link to you and reach out to them.

Presumably, you’ve made it through not one, but two guides on how to successfully launch a new web property and ultimately get visibility not just in the SERPs but amongst key influencers in your vertical. I hope now that link building doesn’t seem quite as daunting as it once did and I wish you great success!

Remember there are two solid ways to build links: Make News or Make Friends.

Which one are you prepared to do?


View the original article here

The Real Impact of the Google SmartPhone Crawler (Part 3): Avoiding Mobile Mis-Indexing

This is the third and final installment in this mobile SEO blog post series, covering the impact of the new Google smartphone bot and how you can use it to make the most of your mobile content. The first article in the series discussed how the new smartphone bot works and which sites will be most affected. The post from last week discussed how to author redirects correctly to ensure that your mobile content will be properly indexed by the smartphone bot. This final post will review common search engine indexing problems that mobile sites and mobile platforms have, and how you can prevent them.

Some SEOs insist that we must believe what Google tells us about how and why they index things the way they do; that indexing is consistent, predictable and flawless. Unfortunately, that is not the case, especially in mobile where there are more pages and more potential for things to go wrong. Believing that indexing will always happen correctly, and that you need not mitigate risk factors for mis-indexing will not create the ideal SEO scenario. It will leave your sites (mobile/desktop/tablet) exposed when there are changes to the algorithm, or when new crawlers are evaluating your site for the first time.

If you are trying to ‘dot all your ‘i’s’ and cross all your ‘t’s’ in the world of mobile search engine indexing, here is what you need to know to prevent mis-indexing:

Google has never and will never like duplicate content. Google’s new smartphone bot, and their decision to index and cache mobile redirects may be a way for Google to avoid or minimize the need to index entire mobile pages (possibly), but it is still hard to tell how it all works. Adding mobile pages into a mix will always presents the RISK that something will be misunderstood as ‘duplicate’ and cause problems.

To keep Google happy, in the mobile world it is especially important to avoid the sneakier kinds of duplicate content that some webmasters forget about, otherwise known as DUST. The acronym stands for Duplicate Url Same Text [Acronym shared with my by the awesome Lindsay Perkin-Wassle, of Keyphrasiology]. DUST happens any time more than one version of a URL will resolve in the address bar but the browser shows the same page. The easiest example to understand is a page rendering with or without inclusion of the ‘www’ in the URL (the canonical v. non-canonical discussion usually stops here), but DUST can also be seen when there are multiple versions of a home page or category level page, as in the examples below:

http://www.yoursite.com/
http://yoursite.com/
http://www.yoursite.com/index.asp
http://yoursite.com/index.asp

It is quite common for sites to allow all four of these URLs to be linked to or typed into the address bar so that the home page will be served. (This can happen at category level pages too, like
www.yoursite.com/cindy and
www.yoursite.com/cindy/index.asp)

http://m.yoursite.com/
http://m.yoursite.com/index.asp

Adding mobile pages to the mix makes this even more confusing and cumbersome for Google.

In the mobile SEO world, it is quite common for mobilization platforms to control the servers and databases that generate the mobile content, and they are infamous (maybe only in my mind) for generating lots of DUST. Even the best mobilization platforms typically have minimal understanding of SEO; they try to set their servers to be very flexible with what page requests they can correctly render, and render as many different variations of a URL as possible. Instead of doing this, the platforms should be setting up the servers to 301 redirect any version of the URL that is not the canonical ‘chosen’ to redirect to the ‘chosen’ version of the URL. This is also how you can set up your own servers to prevent DUST.

There is a risk that Google’s new smartphone crawler may be overly literal at first, and rely exclusively on the redirects that are in place, but not evaluate other signals or algorithmic elements. This means that it will probably also have a heightened the sensitivity to errors that are present on a site or in a redirect.

In general having lots of errors on your site can hinder crawling and indexing and cast your mobile site (and possibly desktop site too) in a bad light. Be sure that you check the content frequently for indexed 404 errors in Webmaster Tools, especially if you are generating dynamic mobile pages or using a hosted mobile solution to generate your mobile pages. To make finding and fixing 404s easy, you should set your mobile content up in a separate Webmaster Tools account. This way, you can see just the errors and information related to the mobile content, and not have to subtract out desktop figures to generate meaningful information.

Many 404’s in mobilization platforms are caused by improperly expired mobile content, but you should also watch for 404 errors caused by a lack of capitalization normalization and trailing slash rules set up on the server. See the example below, where one version of a URL is working fine, but the same URL with a capital letter is understood as missing, and being redirected to the mobile home page. (This is also DUST – your server can automatically normalize URLs to remove capitals.)

Capital letters in the URL cause a 404 or redirect to the mobile home page:

Actual URL:
URL with a Capital ‘C’:

http://m.yoursite.com/cindy/
http://m.yoursite.com/Cindy/

Successful
404 Error or Redirect to Home Page

The presence or absence of trailing slashes can also cause problems, as shown below:

Actual URL:
URL with a Trailing Slash:

http://m.yoursite.com/cindy
http://m.yoursite.com/cindy/

Successful
404 Error or Redirect to Home Page

Whether the page is 404 or just redirecting to the home page, this is a problem. Stuff like this REALLY happens all the time, especially when the mobilization platforms are in charge of the server, so if you are working with an external mobilization vendor, go check this stuff out when you are done reading the article. Error-based redirect to the home page could be somehow mis-indexed as the mobile redirect.

Mobilization platforms will usually not archive mobilized pages for long periods of time, especially for sites that generate new content on a daily basis, but they also generally don’t have a proper mechanism to expire the content in a way that is good for SEO so a very similar scenario could happen with a 404 error on a page that has expired.. Mobilization platforms will generally just remove the content and leave a 404 error, which makes the mobile site look bad, because as you are constantly generating new content, you are also constantly generating new 404 errors at the same rate.

What if Google took the 404 errors on the mobile pages seriously? What if Google somehow associated the errors on the mobile pages with the corresponding desktop pages even though they were still live and fine? Hopefully Google would not let the 404 on the mobile page drag down the credibility and rankings of the desktop page that was redirecting to it, but it is not worth the risk! If you are worried about it, there is a Mobile SEO Tool to help you check indexing of one domain across the desktop and WAP index. 

When you are optimizing your mobile content, the best bet is to always play it safe, and keep your content and your server settings as neat and tidy as possible. Avoid the risk of mis-indexing by checking your URLs and watching for errors. When you add more pages and more redirects, and potentially even more servers and different companies to the mix to achieve a good mobile user experience, you increase the risk of mis-indexing.

Thanks for tuning in to this mobile SEO series about optimizing for Google’s new smartphone bot! If you missed the previous articles, they cover important information like how the new bot works, which sites will be affected and how to generate the right kinds of redirects to ensure that your content is correctly indexed by the new bot. Good luck with all of your SEO efforts and stay mobile!


View the original article here

The Penguin Update - Whiteboard Friday

SEOmoz and I don't always see eye to eye on industry issues, but I still have a lot of respect for the company. In fact SEOmoz is still the website that I send people to, when they want to learn about SEO or get into our industry. Rand kindly invited me to the SEOmoz office when I was in Seattle this week, for a chat and the opportunity to present a Whiteboard Friday.

This week's Whiteboard Friday covers the recent Penguin Update, including what to do and what not to do. I certainly wouldn't say that it's a comprehensive guide, but it does discuss the issues and causes that I have witnessed. Fortunately Ayima's campaigns have been unaffected (other than increases) by the update, but we do monitor our client's competitors and their agencies to a very granular level using in-house technology. Off-Page SEO has been changing dramatically for a while now, and it's important that agencies and in-house teams don't get left behind. Always ask questions and never just assume that Google whacked you by mistake, even if you are "White Hat".

Hello, and welcome to another Whiteboard Friday. My name is Rob Kerry. I'm co-founder of an SEO agency called Ayima. Today we're going to be talking about the Penguin Update. There's been a lot of talk in a lot of communities out there, a lot of SEO communities, about the Penguin Update. A lot of false information being chucked around out there as well. Hopefully, this video clears up quite a few things.

The first issue is that a lot of people still use the term white hat, grey hat, black hat. Now, this terminology was taken from the hacking world and adopted for SEO reasons. It's actually in Google's best interest for us to use this terminology because it makes SEO sound like a risky, dangerous, almost illegal thing to be doing. Whereas if you actually use the hacking terminology and adapt it to SEO, the only thing that is black hat SEO is hacking someone's website and embedding links into there for SEO reasons. Everything else is basically white hat, because you're either getting permission from another webmaster to have a link on their site, or you're making adaptations to your own website, all of which would be classed as white hat.

Rather than looking at whether you use a white hat SEO provider or a black hat SEO provider, actually have a look to see what techniques are being used. Even if you're not buying links, you can still get affected by the Penguin Update. This isn't an update about whether you are buying links or not buying links. This an update about how you're trying to manipulate Google.

If your white hat SEO provider is currently just putting links into your site for commercial terms or even only putting 50% of the links in using commercial terms, let's say we're trying to rank for the term "penguin," if half your links or more are saying penguin in them, then you're going to get tripped up in this kind of filter because you're seen as manipulating Google, even if those links were acquired through directories or through asking for links or through viral campaigns.

So, rather than looking at that, we need to look at the footprints that are going into your site. Quite a good case study for that is we have a client who works with a lot of seasonal campaigns. We were about to run one at the beginning of this year for an event, which they sell products for. A competitor SEO agency in the UK works with one of their big competitors, one of the big competitors of our client. We were basically monitoring to see what that other SEO agency was doing. Three months before the seasonal campaign needed to launch, they started building links into their client's website using the commercial anchor text, so people putting links in saying penguin, penguin, penguin, going into those client pages. Whereas, we went with a different tactic.

We actually changed the way that we do SEO in terms of off-page SEO about a year ago, predicting that this kind of update would get rolled out. With our clients now, as long as the on-page is optimized properly and there are a few links going in using commercial terms, then we basically just build up the authority and the trust of our client website.

It sounds like kind of a lame idea, and it goes against traditional ideas of SEO, but it does actually work ever since this update rolled out. So, whilst we were starting to go up and up and up in the rankings, eventually hitting number one place for the biggest term for this seasonal campaign, we noticed our competitor going down and down and down.

There are even complaining on Twitter that Google might be broken, there's an algorithm issue, just because they didn't understand why putting loads of anchor text with commercial terms going into the client's site wasn't working. It's basically because Google has been working towards this kind of thing for quite a long time.

So, have a look at your anchor text ratios. Go to Open Site Explorer, type in your website, click on the anchor text link, and that will order it by, I think, group linking domains. You can actually see what links are most used on each URL of your website. If your commercial terms are quite near the top, let's say in the top 10, then you need to really work at getting better links going into your site and maybe even taking down some of the links, which are overly optimized. This is basically their step towards an over-optimization penalty.

There's another thing, which is content providers, who as soon as the Penguin Update rolled out, we got a barrage of emails from all of these people saying, "We can fix Penguin by building lots and lots of more pages of content for your site." These would actually negatively affect you, because one of the things that Penguin's trying to do is further penalize the production of crap content.

Rather than paying thousands and thousands a month to have 200-words news articles put onto your website, get rid of those if they're not actually bringing any traffic in. Look at actually creating a good quality resource of information on your website to become the authority in your industry. A few pages of great content is a lot better than just hammering Google with loads of news articles.

The big thing is there's no quick fix. If you get an email from a company saying that, "We can fix all your Penguin issues," it's likely not to be the case, especially if it's like a $35 fix. You just basically need to build a better campaign for your website. Look at taking down content which might not be unique or useful information. Get rid of some of that from your website if it's not driving any traffic directly to it.

Also, look at just making your website look as natural as possible. Build authority into the pages that you want to rank, but don't start over- optimizing on the anchor text. If you start doing that, not only will it fix Penguin issues, but it will also help you to rise up in the rankings. Thank you very much, and that's about it.


View the original article here