Wordpress - irthoughts.wordpress.com - IR Thoughts
General Information:
Latest News:
Big Data or Big Pitfalls? 19 Aug 2013 | 10:03 pm
Here is a nice article about the risks of misusing big data Here are my comments on the topic: 1. Most traditional statistically significant analyzes were meant to be used with small data sets, not ...
From Searching to Mining 17 Aug 2013 | 07:01 am
It looks like there is light at the end of the tunnel. http://searchenginewatch.com/article/2289568/New-Data-Mining-Tool-Will-Let-You-Make-Your-Own-Private-Search-Engine Filed under: Data Mining, Hu...
On Words, Strings, and Co-Occurrence Studies 2 Aug 2013 | 06:38 pm
On Words and Strings In String Frequency Distributions, Mark Liberman blogs about the flaws involved when co-occurrence studies are reported without defining what is a “word” in the first place. I a...
J.K. Rowling and the death of the long tail 21 Jul 2013 | 11:07 pm
http://ideas.time.com/2013/07/19/j-k-rowling-and-the-death-of-the-long-tail/ …and the end of a fiasco. Filed under: Data Mining, Statistics and Mathematics
When Orthography impedes Information Retrieval 12 Jul 2013 | 12:14 am
Here is an old, still relevant essay written by T. A. Brooks on orthography as a fundamental impediment to online information retrieval. Some of the problems pointed out by Brooks are still around. h...
“Powered by” in Spanish 14 May 2013 | 11:21 pm
The Problem: When it comes to properly mean “powered by” in Spanish web pages, a lot of spanish-speaking users don’t … Continue reading »
Some nice features added to the Image Crawler 13 Apr 2013 | 12:31 am
Some nice features added today to the Image Crawler, to please requests from current users. Thank you for the feedback. … Continue reading »
The Images Crawler 11 Apr 2013 | 11:18 pm
The Images Crawler has arrived at Mi Islita.com. An easy way to view images from Web documents. Use it to … Continue reading »
A nice service for my locals 8 Apr 2013 | 07:02 pm
Puerto Rico Daily News & Image Searches. Driving traffic to Puerto Rico’s best media sites. The fastest way to find … Continue reading »
An update to the Web Crawler 5 Apr 2013 | 12:20 am
A nice update, indeed: http://www.miislita.com/web-crawler/web-crawler.php Filed under: Data Mining, IR Tools, News, Scripts, Software