Looks like the Great Firewall or something like it is preventing you from completely loading www.skritter.com because it is hosted on Google App Engine, which is periodically blocked. Try instead our mirror:

legacy.skritter.cn

This might also be caused by an internet filter, such as SafeEyes. If you have such a filter installed, try adding appspot.com to the list of allowed domains.

Hanzi Builder

mcfarljw   January 4th, 2010 10:27a.m.

This forum post is a pseudo solution to the suggestion in the previous "Suggested Words" post.

You can read a little more about it on the site, but it will allow you to copy/paste your Skritter export and it will suggest new words strictly based on those you've been studying (no more no less). You can then copy them to your queue and see if the words will validate (they are all based on a larger dictionary than Skritter currently has so several might not exist yet).

The script behind it is built for another one of my projects, but it seemed to adapt well to Skritter. Hopefully it will allow you to add some new words to your queue that you can learn quickly and painlessly.

http://hanzibuilder.joshmcfarland.net/

jochemb   January 4th, 2010 10:47a.m.

Ha, nice, I've done the same:

http://huygens.functor.nl/skritter/wordlist/

nick   January 4th, 2010 11:03a.m.

This is great, guys! The first one turned up more words (28,303 vs. 13,500) but took longer and crashed if I put in more than 4500 words/characters. The second one was able to grab the words with more than two characters.

Would love to see the frequency sorting going on here, so we could see which important words were missing instead of just seeing all words.

jochemb   January 4th, 2010 11:36a.m.

I've also added support for maximum word length. My version seems significantly faster at the moment.

Frequency sorting seems a nice feature. I could implement it if somebody points me to a reliable and extensive frequency list.

mcfarljw   January 4th, 2010 11:55a.m.

Yes, nice job jochemb. I initially set mine up to only handle a length of two and I wanted to practice working with permutations in php. Naturally they have no practical use when the equation is changed to 3 or 4 haha as it would take a powerhouse machine to process it at any speed.

If I find my frequency list I'll send it to you. Though my home computer is refusing to let me remote in at the moment.

mcfarljw   January 4th, 2010 1:28p.m.

I just went back and did it the correct way as opposed to my last crazy experiment haha. It still runs slightly slower, but there was only a few seconds between both my tests. I will blame it on the web hosting service.

jochemb   January 4th, 2010 4:17p.m.

Hey guys, Nick sent me a frequency list and list of all words in Skritter. I have used these to implement two improvements:

- All words returned are actually in the Skritter database.

- The results are sorted in order of decreasing frequency. (So words at the top are the most commonly used according to some frequency list.) Not all words were in the frequency list, they are given the lowest frequency possible, and are therefore at the bottom of the results.

Xerxes314   January 5th, 2010 1:20p.m.

Why restrict the list to words that are already in the Skritter database? One of the best features of Skritter is that the database is user-extensible.

jww1066   January 5th, 2010 2:09p.m.

These are awesome, guys. Thanks to you, my queue is now well over 4,000 items... ;)

James

jochemb   January 5th, 2010 3:46p.m.

Xerxes314: I am not entirely sure why I did that. I can revert it if you want, or make it an option.

jww1066   January 5th, 2010 5:09p.m.

@jochemb: I find it more useful if the words are already in Skritter.

James

mike_thatguy   January 6th, 2010 12:31a.m.

@jochemb: Me too -- as long as this master Skritter list gets updated periodically.

jochemb   January 6th, 2010 10:24a.m.

I would have to coordinate that with the Skritter team, if they're willing to send me an up-to-date list every now and then.

nick   January 6th, 2010 11:22a.m.

It would take almost two weeks to grow by 1%, so it shouldn't need to be updated very frequently--but yeah, we can do that.

ndsino   January 7th, 2010 8:14a.m.

This is a useful tool, indeed. Thanks for making it jochemb. I've got a question. Is there a way to take all those new words (it generated over 6000 for me), and then tell it to only give me words that are also in the HSK?

This forum is now read only. Please go to Skritter Discourse Forum instead to start a new conversation!