All posts by Alistair Lattimore

About Alistair Lattimore

My name is Alistair Lattimore, I'm in my very early 30's and live on the sunny Gold Coast in Australia. I married my high school sweet heart & we've been together for longer than I can remember. Claire and I started our family in September 2008 when Hugo was born and added a gorgeous little girl named Evie in May 2010. You can find me online in the typical hangouts, Google+, Twitter & facebook. .

WordPress Plugin: Kottke Style Archives

Kottke Style Archives, a WordPress archives plugin which emulates http://www.kottke.org/everfreshAre you sick of seeing WordPress archives that stretch the length of your arm? If you are, then you might have just found the next best thing to sliced bread! The Kottke Style Archives plugin is a WordPress archives plugin which will display your yearly archives in a similar format to Jason Kottke.

If you’re wondering why you might want your archive page looking like that, there are a couple of simple explanations:

  • You don’t like having to scroll
  • You need or value your screen real estate
  • You want your site to be more accessible on mobile devices, see point #2
  • You want a change, after all – everyone has the same ol same ol archives page

You can find out all about the Kottke Style Archives plugin at its permanent home. If you have any comments, problems or suggestions – feel free to drop them into this thread.

Google Malware Warning

Google intergrate malware blocking services into their search servicesI was recently searching for information using Google and was suprised to see that they have integrated a malware blocking service into their search results.

As Google are indexing the internet, they are always taking into consideration what content is on your site. They are now using that information to warn their users of a web site which might contain suspect or malicious content.

Personally, I think that it is an excellent service to provide the Google user base. A lot of people who simply ‘use’ computers aren’t aware how easy it is for their computer to become infected with all sorts of nasty stuff. At least if they are confronted with an intermediary page as listed above, it will make the users think twice about viewing the site or using any of the content/services which they may provide.

Wired Purchases Reddit

It seems that in recent times, aquiring cool community driven sites has become the new black. In the latest round of bringing black back into vogue, Wired have stepped up to the plate and have sucessfully struck a deal with Reddit.

For those that aren’t aware, Reddit is a social news site. If you’re asking yourself what that means, its simple; instead of the site owners deciding what is newsworthy – the users of the site submit the news items. To keep the news relevant and fresh, other users can choose to vote a news item up and down in importance. The higher the number of votes a news item receives in a given period, the higher the item will float until it ultimately reaches the #1 story on the site.

Reddit was initially funded through Y!Combinator and to the delight of Paul Graham, it was written in Lisp. Not too long after the site was launched, the owners of the site controversially rewrote Reddit in Python to the horror of the Lisp fanatics. It raised such a noise online, soon other Lisp fanatics were posting articles on how to (re)-write Reddit in 100 lines of Lisp.

The question now is, who is next? Rumors have been flying rampant about Digg.com and it was confirmed that certain parties were interested but that they couldn’t come to a reasonable (read: less than USD$150M) dollar value.

Remembrance Day

Remembrance Day, a solemn day to remember the men and women who gave their lives in war over the last century. It is customary for all Allied countries to serve at least one minutes silence starting at the 11th hour of the 11th day of the 11th month each year.

On this day, I’d like to leave you with a famous poem which is synonymous with Remembrance Day around the world. The poem “In Flanders fields” was written by Lieutenant Colonel John McCrae on May 3rd 1915.

In Flanders fields the poppies blow
Between the crosses, row on row,
That mark our place; and in the sky
The larks, still bravely singing, fly
Scarce heard amid the guns below.

We are the Dead. Short days ago
We lived, felt dawn, saw sunset glow,
Loved, and were loved, and now we lie
In Flanders fields.

Take up our quarrel with the foe:
To you from failing hands we throw
The torch; be yours to hold it high.
If ye break faith with us who die
We shall not sleep, though poppies grow
In Flanders fields.

Please remember to show your respects to the men and women who gave their lives to provide you the luxuries and freedom which you enjoy every day of yours. For those people who find this site and don’t know or understand what this special day is about, please feel free to read about Remembrance Day online.

What Is A Search Engine?

A search engine is typically a software application which is capable of receiving requests and returning results based on a simple human readable text phrase or query. The query is received and the search engine then evaluates the request and attempts to find the most relevant results from its database. The relevancy of the results returned is based on complex rules and algorithms which rank each unique resource in its database. The results of a search request are typically sorted in descending order based on relevance to the search query.

Search Engine Types

There are three main types of search engines:

  1. human generated
  2. automated or algorithm based
  3. a hybrid of the two previous options

A human generated search engine is what is now generally considered a directory. Users submit their sites and the administrators of the directory review and include sites on their discretion. If a particular web site is included into the directory, it is evaluated, categorised and subsequently placed within the directory. The most widely known human generated search engine in existence today is the Open Directory (dmoz.org).

An automated or algorithm based search engine does not rely on humans to provide information for searches to take place on. Instead, an algorithm based search engine relies on other computer programs, known as web crawlers or spiders to provide the data. Once the web crawlers or spiders have provided the data, separate computer programs evaluate and categorise the web sites into the directory.

The hybrid search engines combine both human generated and an algorithm based approach to increase the quality of the search data. In these systems, the internet is crawled and indexed like an automated approach; however the information is reviewed and updated as this process takes place.

Search Engine Strengths & Weaknesses

Each technique described above has its own strength and weaknesses. In a directory style search engine, the quality of the results is often very high due to a physical person reviewing the content on the web site and subsequently taking the appropriate actions. Unfortunately, due to the ever increasing number of web sites and content on the internet, requiring human intervention to rank and categorise a web site doesn’t scale.

In a purely automated approach, the search engines rely on the speed of software applications to index the internet. While the human based approach might allow for tens or possibly hundreds of pages to be categorised simultaneously; a search engine spider or crawler is capable of doing thousands or millions of pages simultaneously. The obvious problem with this approach is that since the search engines rely on algorithms, the algorithms can be exploited. In years gone past, “webmasters” cottoned onto how these type of search engines worked and started abusing the system by including keywords into their site which had nothing to do with the primary focus of the page or domain. The search engine would spider the site and suddenly an online shoe shop is coming up in searches for porn, drugs, gambling and more.

The hybrid based approach attempts to resolve the two aforementioned issues by crawling the internet using software applications and reviewing the results. The algorithms which rate and categorise a particular web site are tuned appropriately over time and the results they produce are monitored very closely to ensure accuracy of the search results. Companies which implement a hybrid based approach have teams of people whose soul purpose is to review the validity of various search results. If they find results which they would consider to be out of place, they are marked for investigation. If the results they expect do not come up, that is also noted down and sites can be manually included into the search index.

Now that you know what a search engine is, keep your eyes peeled for a follow up on how search engines work.