What are some ways to have fun with a large amount of data? (ie, the Twitter, del.icio.us etc. APIs) - Artificial Intelligence

AnkurSethi
April 5, 2009
214 views
3 votes
7 Answers

Twitter, Google, Amazon, del.icio.us etc. all give you a lot of data to play with, all for free. There’s also a lot of textual data available through initiatives like Project Gutenberg. And that, it seems, is just the tip of the iceberg.

I have been wondering how you could use this data for fun. I’m a first year IT student, so I have no knowledge of statistics, machine learning, collaborative filtering etc. My interest in this area was piqued by the book Programming Collective Intelligence by Toby Segaran, and now I want to take a deeper look at what you can do with data. I don’t know where to start. Any ideas?

I have also been pondering whether I should go and buy something like Paradigms of Artificial Intelligence Programming. Is it worth the trip across the city?

Answers

- anon
- April 5, 2009 at 4:43 pm
- 0 votes
0
Try firing books in different styles from Guttenberg through a Markov Chain generator – there’s one in Perl here to get you started.

Login or Signup to reply.

- JohnFarrell
- April 5, 2009 at 4:51 pm
- 0 votes
0
Visualizations, do them, share them.

Login or Signup to reply.

- RobertGould
- April 5, 2009 at 5:14 pm
- 0 votes
0
You can make puzzles like hangman games. Or a mashup or try Yahoo pipes to join information.

Login or Signup to reply.

- sep332
- April 10, 2009 at 10:10 pm
- 0 votes
0
You can use some of that data to make money (if you’re really good!)
http://www.netflixprize.com/ Netflix has made available an anonymized dataset, and are asking for better algorithms to predict customer choices.

Login or Signup to reply.

- timday
- April 10, 2009 at 10:26 pm
- 0 votes
0
Predict future stockmarket trends from the data. Profit!

Login or Signup to reply.

- theycallmemorty
- April 13, 2009 at 4:55 am
- 0 votes
0
If you’re familiar with Python try playing around with the nltk. It has tons of libraries for text mining and even machine learning in general. Try working your way through nltk book.

Login or Signup to reply.

- JamesVanBoxtel
- April 20, 2009 at 8:12 pm
- 0 votes
0
If you want to start off with a easy AI problem, you might try clustering.

http://en.wikipedia.org/wiki/Data_clustering

You could use it to group flickr images together by tag or something cool like that.

Login or Signup to reply.