Wordpress - irthoughts.wordpress.com - IR Thoughts

Latest News:

Big Data or Big Pitfalls? 19 Aug 2013 | 10:03 pm

Here is a nice article about the risks of misusing big data Here are my comments on the topic: 1. Most traditional statistically significant analyzes were meant to be used with small data sets, not ...

From Searching to Mining 17 Aug 2013 | 07:01 am

It looks like there is light at the end of the tunnel. http://searchenginewatch.com/article/2289568/New-Data-Mining-Tool-Will-Let-You-Make-Your-Own-Private-Search-Engine Filed under: Data Mining, Hu...

On Words, Strings, and Co-Occurrence Studies 2 Aug 2013 | 06:38 pm

On Words and Strings In String Frequency Distributions, Mark Liberman blogs about the flaws involved when co-occurrence studies are reported without defining what is a “word” in the first place. I a...

J.K. Rowling and the death of the long tail 21 Jul 2013 | 11:07 pm

http://ideas.time.com/2013/07/19/j-k-rowling-and-the-death-of-the-long-tail/ …and the end of a fiasco. Filed under: Data Mining, Statistics and Mathematics

When Orthography impedes Information Retrieval 12 Jul 2013 | 12:14 am

Here is an old, still relevant essay written by T. A. Brooks on orthography as a fundamental impediment to online information retrieval. Some of the problems pointed out by Brooks are still around. h...

“Powered by” in Spanish 14 May 2013 | 11:21 pm

The Problem: When it comes to properly mean “powered by” in Spanish web pages, a lot of spanish-speaking users don’t … Continue reading »

Some nice features added to the Image Crawler 13 Apr 2013 | 12:31 am

Some nice features added today to the Image Crawler, to please requests from current users. Thank you for the feedback. … Continue reading »

The Images Crawler 11 Apr 2013 | 11:18 pm

The Images Crawler has arrived at Mi Islita.com. An easy way to view images from Web documents. Use it to … Continue reading »

A nice service for my locals 8 Apr 2013 | 07:02 pm

Puerto Rico Daily News & Image Searches. Driving traffic to Puerto Rico’s best media sites. The fastest way to find … Continue reading »

An update to the Web Crawler 5 Apr 2013 | 12:20 am

A nice update, indeed: http://www.miislita.com/web-crawler/web-crawler.php Filed under: Data Mining, IR Tools, News, Scripts, Software

Related Keywords:

better whois, betterwhois, tf idf, spearman correlation, seo quack, svd pca lsi, seo qwack, normalize vector, seo vector space model

Recently parsed news:

Recent searches: