Today Google formally announced on its blog that it has completed its launch of he new indexing system dubbed “caffeine”. Google originally announced the caffeine indexing infrastructure in August of 2009 and then launched a version of it on a single data-center later in the year. Today’s announcement makes it official that caffeine is live on all Google data-centers.
Highlights of the Caffeine Announcement
The size of the Google index is larger than ever
Google claims that all search results are 50% fresher meaning that new content being added to the web will appear in search results faster. This does not change the frequency of Google’s indexing of websites, it simply means that new information is pushed live to the index faster than before.
Storage Capacity Increased Greatly- Google’s caffeine currently requires 100 million gigabytes of storage. (that’s 625,000 of the largest ipods to you and me).
The primary reasons for the update are to deliver fresher content at a close to real time rate and to create a foundation for the future as the internet grows.
Impact on Search Engine Optimization For right now existing search engine optimization best practices will not be impacted or impacted only slightly. Google has been adamant that the caffeine update does not impact search results. However, they are also very clear about the fact that the new indexing architecture allows it more flexibility in the way that Google stores data and associates data together. In an interview with Vanessa Fox, Matt Cutts stated ““It’s important to realize that caffeine is only a change in our indexing architecture. What’s exciting about Caffeine though is that it allows easier annotation of the information stored with documents, and subsequently can unlock the potential of better ranking in the future with those additional signals”.
These types of statements and subsequent statement out of the Googleplex clearly indicate that they have increased the potential to make rapid algorithmic changes based on changes that occur with the internet. As we have mentioned in previous posts, Google cares more about page speed so it is still recommended to work hard increasing the efficiency of your website. In addition, Google’s capacity to consider “additional signals” in the future means that they will have the capacity to consider many additional factors in their ranking algorithm including on-site and off site factors.