Independent Consultant (Toronto, Canada) specializing in Lucene, Hadoop, HBase, Nutch, SOLR, LingPipe, GATE, Data Mining, Search Engines, WebLogic, Oracle, Liferay Portal, Java, J2EE, SOA, and more.
Master in MATH, Moscow State University n.a.Lomonosov
No Frills Comparison Shopping for Computers Tops My Wish List
Interesting article about Tokenizer, Shopping Price Engine
I wish I can normalize data soon. Some stores like Best Buy show 200 for pages which do not exists; with a short message "Item not found". I already know how to automate mining in this specific case: I'll simply calc number of inbound links. Without mining ;)
For submissions, send Email to the Agent.Submit URL here
Labels: Shopping Price Engine Tokenizer Computer
- this blog is simply the best guide on Flex!
Thank you, Christophe.
MySQL, Trees and Graphs. Is Joe Celko right?!
Interesting article about SKU; it IS a problem for first-generation comparison sites! How to standardize product names? What about ***Free Delivery*** and ***Refurbished*** in product titles?
Last year, November-06, I bought Cheetah 15K.5 SAS, and merchant http://www.directdial.com
didn't know exact product name. Seagate published this SKU on their website only in April 2007!!! I was lucky: using similarities in SKU namings from Seagate, I was able to guess.
Labels: Comparison Engine RSS SKU