July 9, 2008
I've been using my OLPC to read ebooks lately. In particular, I signed up for Tor Book's free weekly sci-fi ebbok giveaway (register here, which I found out about through the Slashdot posting) and have received a whole bunch of books from them.
In the past I've read ebooks on my Handspring Visor and Treo 600 that I've gotten from the Project Gutenberg library. As a side-note, how depressing is it that basically, things before 1923 have had their copyrights expire and are free for public domain use, but everything else is questionable. If the work was published before 1964 and hasn't had its copyright renewed, it is public domain, but anything after 1964 is under copyright until the death of the author plus 90 years. I will be dead before I can legally use what is essentially, the culture in which I grew up. There is a nice list of "recent" science fiction at Project Gutenberg listed on the Thunder Child site.
Also, the Baen Free Library is an excellent source of science fiction - and their Webscriptions Service is great too - I've used it a few times to buy nice DRM free books. I highly recommend Rick Cook's Wizard books for some computer geek fantasy. The first one is free, so you might as well check it out. There are compiled and Lisp jokes mixed in with Dragons (well, my personal feeling is that Lisp is full of Dragons anyway, but...)
So, I'll use this post to review some ebooks that I've read.
Harry Turtledove: The Disunited States of America
I didn't realize this but it turns out that this book is the fourth in the "Crosstime series" - so there are other ones that come before this. I haven't been too interested in much of Turtledove's work since I read one or two of his novels, but generally I like his writing style, but don't think the ideas are too interesting and there usually isn't much in the way of subtlety. I enjoyed this book, but I'm not too fond of stories where the hero is a young teenager (which this book features) although it isn't as bad as the standard Japanese trope of "The fate of the world rests on a moody young teen" (who needs to save the world using a Giant Robot, or Magic Sword, or Henshin powers.)
It was worth reading as a free book, but I'm not interested in checking out the other three books in the series. Still, if you like Turtledove's alternate history style, you will probably like this book. Check it out!
William Poundstone: How would you move Mount Fuji?
The book isn't really about the actual puzzles that pop up in interviews as much as the history and background of the interview style. There is also an interesting chapter about how he thinks interviewing should be done. I'm not a big fan of puzzle questions in the interview since the whole process is very stressful, and I don't think that these questions test how good a person will be for a job as much as how good the person is at solving puzzles. I think you are better off looking at how well someone performs on questions that are relevant to the job, so for programmers and CS area jobs I like computer science / algorithm / design / coding questions. Those can be pretty difficult as well, and still be relevant to the job. If you are relying too much on puzzle questions, you are going to get people who prepare for puzzles, but I don't think the kinds of things that pop up at a job are as well defined and clean as puzzles.
Anyway, this book does have a lot of sample puzzle questions and very good explanation of the solutions to them, and it also has a very interesting overview of interviewing in general. I recommend it!
John Scalzi: Old Man's War
I really enjoyed Old Man's War, and I see that he has about three other books in the same universe. I'm going to order them for sure. Acutally, I have really enjoyed reading this book on my OLPC, the form factor is about right (there is some stuff I would do to make it better, but it is pretty nice) and the OLPC is so versatile (I use it for studying Japanese with Anki and can use it to take notes, look up words in Japanese, etc.) that it is convenient to take with me on an everyday basis. If I will have it with me anyway, it is about as easy to read on it as it is to read a paperback, and I don't have to worry about storing the book later, but I can still keep it indefinitely. I really wish that Tor was selling their catalog like Baen does. I would buy all Scalzi's book right now. Instead I'm going to wait for a little bit and make a big Amazon order I imagine. Alternatively, it would be nice if Amazon sold the Kindle in Japan, but honestly I think I prefer the OLPC if I'm going to lug something around. The OLPC is more open to hacking and is already a very multi-functional device for my needs.
I class Old Man's War up there with Starship Troopers and The Forever War, two excellent books about war in the future. I don't know if it will have the impact of either of those two books, but I think if you liked either of those, you really should read this one too.
July 6, 2008
Asian American comedians on American TV (watched in Asia by an American)I really like the internet. Thanks to the Internet, I'm able to keep up on news in the US, and it helps me feel still connected to a culture that I left behind two and a half years ago. (I also make frequent calls home, but that is more a personal than cultural connection.) I get most of my news from podcasts. I highly recommend WBUR & NPR: On Point with Tom Ashbrook, a nice 45 minute podcast that is only fifteen minutes two short for my commute, NPR's Wait Wait... Don't Tell Me! for a humorous weekly news round-up, This American Life for very interesting stories about the American experience, NPR's Science Friday with Ira Flatow, science!, and Car Talk for funny car-based humor from two ex-MIT guys. There is lots of other good stuff out there, such as the New York-based WNYC Brian Lehrer Show (makes me nostalgic!) and all sorts of other NPR news, but these are the main things that I listen to. Since I'm on the train for a bit over two hours a day, it really helps to have some stuff lined up to listen to. But I also download and watch TV. Lately I've been watching NBC's Last Comic Standing. I've always been a fan of stand up comedy, and liked going to shows when I was in New York. I've been to a few shows here in Tokyo and enjoyed them, but I don't think the American style of stand up comedy is very common here (see my previous post on Japanese comedy and note that Japanmanship is no longer defunct, but is happily re-funct! I vaguely remember Dat Phan in the first Last Comic Standing, and at the time he really came under a lot of fire for being a one-trick pony, only relying on jokes that play on obvious racial stereotypes. I thought he was pretty good, but I don't remember very much about his comedy now, so I won't comment too much on it. In this new version of Last Comic Standing there are two comedians that I really, really want to like, but am having a really hard time with. Esther Ku and Papa CJ. (There are other acts that I don't think are funny, such as God's Pottery, which is basically just a one-trick musical act making fun of the religious right. I don't see how they can have anywhere near the range required to advance in a show like this.) I wanted to like Esther because she's cute and asian. I won't comment about whether Papa CJ is cute, but he's asian too and has a bit of a geeky flavor to him that could be really funny. I think they both should have some really interesting experiences to mine for comedy, but I was really disappointed that Esther seemed to go for the obvious racial based jokes that aren't new and aren't funny. She has a few things that I think are funny, but she's not very subtle, and some of her stuff is just depressingly obvious (asians can't tell asians apart either! koreans eating dogs! were low points for me.) She is cute and bubbly though, and when she stays away from racial stereotype stuff (like when she was just talking about men buying women dinner) she is funny. Papa CJ is interesting, but at least with the way that the show has been cut, he relies too heavily on outsourcing jokes and reincarnation. I guess my problem here is that he used the same material in his two sets, but maybe he's also just trying hard to get through to the next cut and using his best stuff. What bothered me most is that he used a clear set-up of talking directly to an audience member that would have worked great in response to heckling, but just sounded very forced in the way he set it up. Still, both of them made it through to the next show, so hopefully there is more to their humor than I've seen so far. As I expected, there has been a lot of talk on the blog-o-webs about Esther Ku, mostly negative but some positive (kind of) and even an appearance on NPR's Talk of the Nation which I'll listen to pretty soon here. I really hope that Esther is able to bring a bit more subtlety and social commentary into her act - I think Chris Rock does an amazing job of that - and does more non-stereotype based comedy. As long as I'm talking about last comic standing, I want to say that I think that Iliza Shlesinger is really funny, and I hope she goes far. I also really like Paul Foot and Jim Tavare. By the way, the Tokyo Comedy Store's Ken Suzuki was briefly on the show, but they didn't give him much of a chance - which is too bad because he's pretty funny. Anyway, I'm glad that I can at least get some weekly stand-up on TV, which is nice and easy to put on while writing (for example) blog posts.
July 4, 2008
Super cool robot parking systems
July 1, 2008
Books reviews: The Audacity of Hope and The Curious Incident of the Dog in the Night-time
The Audacity of HopeOn the last international flight I took, before getting on the plane I picked up two books. One of them is a kind of current-events sort of deal, Barack Obama's "The Audacity of Hope". I'm really excited about the upcoming US Presidential election. To tell the truth, I can see positives in each of Barack Obama, Hillary Clinton, and John McCain, but I'm most excited about the prospect of Barack Obama becoming president. He made a joke about bubble sorts. He's also been covered a lot on Slashdot, and it really seems like he has a good grasp of some issues important to people in the technology field. I've been intrigued, and have become interested in learning more about his views, so I thought it would be my civic duty to read his book. I'm not really going to give any summaries or big overviews, but I enjoyed the book, and thought it was an easy, fun read. I found his analysis of some of the problems that America faces to be well thought-out and his policies also look very reasonable. There was one chapter on religion and the family that sounded like an appeal to a broad American religious base, but it isn't a fanatical style of mixing religion and government; he clearly believes in the separation of church and state, which is good enough for me. His oft-mentioned points of diversity and hope are also very appealing, so I'm really looking forward to the upcoming election. It is disappointing to me that we only have two major parties and two realistic candidates, but at least that means that if I want to be fair, I only have one other book to read.
The Curious Incident of the Dog in the Night-timeThis is a book that I bought purely on recommendation of a random person on slashdot. It came up in a discussion of science fiction novels (of which I'm a big fan) although it isn't of the space ships and aliens variety. I wouldn't even really classify it as science fiction so much as just fiction written from an interesting perspective. It is really interesting to go into this book without knowing much about it, but I will say that I really enjoyed this book, and highly recommend it. I don't generally like novels that use narrative gimmicks (I can't tell if I'm sick of, or enjoy, the unreliable narrator) but even though this book has its gimmick, it doesn't come off as gimmicky. This is apparently a young adult novel, but I have a thing for well-written young adult novels - particularly Garth Nix's Abhorsen Trilogy and Shade's Children - so that isn't a problem for me. Anyway, I really liked this book, and am going to make space for it on my bookshelf. That's pretty rare because I get the feeling that I won't have too much bookshelf space after the move to the new apartment.
June 30, 2008
Paying the Tax Man, and stuff around townnew Setagaya Museum of Art exhibit on architecture by Osamu Ishiyama.) A few weeks ago I got a big, fat letter from Setagaya county. I knew that they wanted money from me, and specifically money for having the honor of living in their ward. I wasn't in any rush to open it. I wonder if I had just refused to open, like the White House, if it would all just go away. Probably not. So I finally opened it over the weekend, and found out that I had to pay up on Monday. Monday was actually the last day that I could pay the first installment of the tax (although I've paid other installments a bit late, and they have been very kind about it) so I deviated from my standard schedule. Usually I'm out of the house by 7:30am, but today I needed to go to the bank, get some money, go to the post office to mail some stuff, and then go to the ward office to pay the tax bill. I went to my local bank, and was amazed. I have never seen more than two people in that bank ever. There must have been twenty people in line for the ATM machines. I believe that you can register with the ward office to pay the tax bill using the bank system, but I'm not really too sure about that. If I'm going to pay a bunch of money, I like to go down and talk to people, especially because then I can ask questions. That's a lot easier than making sense of tax documents written in Japanese. I still needed to get money out though, so I got hit by the last minute rush. (Nobody was at the actual ward office tax payment division though - that was super quick. Much quicker than waiting for the ATMs.) After pulling out some money, I walked by a local shop that makes tatami mats. I've seen that place once or twice before, but generally they are never open when I am around. I had a camera handy and asked if I could take a picture. It would have been nice to hang around and get some shots of them working on those mats by hand, but I had to get going to get my taxes paid and get to work. It really is pretty amazing though: they do all that weaving and finishing by hand. It looks really tedious. Finally, I also found another interesting candy bar at the local convenience store. Meiji Salty Pralines Chocolate. It is pretty good! I just barely taste the salt, and I really liked the pralines a lot. There are a few more salt and chocolate items that I saw, so maybe I'll pick a few more of them up.
June 28, 2008
Conference HoneymoonAfter L. and I got married, we took a trip to Morocco for our honeymonn. Actually, I had made a committment a long time before to attend the 2008 LREC Conference in Marrakech, Morocco, so we went together as a kind of honeymoon. It was the best honeymoon conference ever. Read more for notes from the trip, and a few pictures. read more (8661 words)
June 21, 2008
Second Apartment ViewingToday I went with L. for the second apartment viewing. On the first viewing we filled out a form with 10 issues we found in the apartment. The majority of them were small bumps in the wall, or a few were areas where there were imperfections in how the finishing on the doors, or how the doors close. They addressed all of the issues, and we pointed out a few more new ones that we found, but generally things are looking good at the apartment.
We were presented with one major change though: in the living room where the pipe from the air conditioner blower unit to the compressor was supposed to be, it turns out that they could not put the compressor where the initial plans indicated they would be. Instead, the compressor had to be moved to the other side of the room, so the exit pipes were changed to the other side. This is a problem because we plan to have custom built shelving put where there is now an exit pipe, and two access panels for the air conditioning unit. In the end the company that does the custom furniture was able to work around the constraints, but it is kind of annoying.
After the second walk-through we went down and spent a while with the custom furniture people working on the wall cabinetry. I'm very happy with it, and excited to see what it will look like when it is done. I can't believe how much it costs, but I'm sure it will be worth it. (For that matter, I can't believe how much the drapes cost either!)
We also had a chance to check out the common facilities at the apartment: the exercise room (looks pretty nice), the two guest rooms (Japanese style and Western style, look very nice), and the library (three study carrels which look very cool - net connections and everything.)
When we finally finished up everything, L., her parents, and myself all went out for an excellent sushi dinner.
June 15, 2008
The NHK Man pays a visitFriday evening I came home from work to a hot apartment: it was raining all day so I had left the windows closed. When I do that my apartment warms up pretty quickly, mostly due to the refrigerator and computer I suppose.
I put some rice on for dinner and opened up the windows but it was still pretty hot. Since I absolutely never have any visitors, I stripped down to my shorts and tried to cool off using the fan.
Sometime around 8pm or so, the doorbell rings. This is very unusual. Nobody ever comes to my place unless I'm expecting a friend or a delivery. Since neither were on the schedule, I assumed it was just some sort of salesperson. I probably should have thrown on a T-shirt and pants, but I figured I could get rid of him fairly quickly, and if he wants to bother me right before dinner he can put up with a shirtless foreigner for the time it takes for me to get rid of him.
When I opened the door, I was surprised that the person there was a NHK representative. I had previously had one visit from the NHK guys. NHK is the Japanese national broadcasting service, like the BBC. They are supported by the government and officially people who have TVs are supposed to pay a licensing fee. I don't have a TV, but I do have a computer monitor that has an internal analog TV tuner, and since I was watching NHK News at the time, I felt like I should pay up. I didn't pay last year when the guy came around, but since then I've been watching a lot of NHK news in the morning and evening (it is good practice) so I really felt like I should pay. So I did. It was about $30 for two months worth of the broadcast fee.
I'm planning to move at the end of next month, so I don't think I'll be paying further licensing fees, but I'm glad I did pay at least once.
Also, some foreigners on the expat blog community have warned of scams where someone pretends to be the NHK guy for collecting licenses so some foreigners are wary of paying. This guy was the real deal though, and I've got some nice official paperwork backing things up.
Still, I wonder what the NHK guy was thinking the whole time, talking to a foreigner in his shorts the whole time. I wonder if I just started some other strange stereotype about Americans for him. :)
June 14, 2008
Saturday morning alarmI was looking forward to getting a bit of extra sleep on Saturday night, but at about 8:45am this morning, there was a pretty big earthquake north of Tokyo (centered on Iwate, a low level 6 on the Japanese scale) which caused shaking down near Tokyo. It was very minor, not strong enough to move anything off my shelves or bookcases, but enough to convince me to get out of bed.
Everything is fine for me, although it was pretty strong farther up north, so I'm watching TV (NHK, I'll have a post about that shortly) to see if there were any injuries or damage caused near the center of the earthquake.
June 11, 2008
Notes from Marriage WeekAs always I am a bit behind posting news to my blog. Sorry about that. If you couldn't tell from my previous entries, I married my girlfriend L. three weeks ago. We didn't have a wedding ceremony, but plan to have one next year in May. My family came to Japan to meet with L.'s family, which was totally great. If you are interested, I've written up the events of the week. read more (3260 words)
June 9, 2008
Busy busy busyI'm back in Japan - got back in just about a week ago. I've been really busy catching up on stuff, and very soon I plan to post an entry about my "wedding week". Next up is a post on the trip to Morocco for a conference / combined Honeymoon. In short: Wedding Week was awesome, got to see my family and my amazingly beautiful wife. The honeymoon was also great, with two "surprise" trips to Casablanca and Milan. More on that later.
For now, I just wanted to say that I am busy, tired, and now sore. I went running this morning with my friend Sebastian, a marathon runner. I haven't run for three months, so I'm out of shape. Horribly, horribly, out of shape.
May 25, 2008
One picture - Traditional Japanese Kimono and Hakama
May 17, 2008
The Future of NewsI do research on automatic opinion identification, and one thing that is really interesting to the community right now is analyzing blog data. Most of the available tagged resources are over newspaper data or movie reviews or other kinds of collectable TEXT - user star rating type things (restaurant reviews, product reviews, etc.)
The community is very interested in moving to blog data, where ostensibly there would be more and varied opinions available to analyze, but there isn't too much data available for that yet. (But see the TREC Blog track work where there is an opinionated blog search task.)
All this interest in blogs and user generated media seems to have had an impact on "traditional" print media. Recently, there was a workshop on the Future of News. It was held at Princeton University, near my old stomping grounds. It would have been nice to go, but thanks to the (news-media destroying!) blogs, I've been able to at least get a brief impression of what was discussed.
Matthew Hurst's great Data Mining blog has two posts with pointers to some summaries from the workshop. Fun stuff.
I'm firmly of the opinion that traditional news media will be around for a long time. Blogs do have some role to play in modern news dissemination, but not a large enough role to displace focused organizations that can fund people to do research and have a vested interest in vetting information. It isn't clear to me that the newswires do as much of this as they should, but the traditional media certainly will play a role in choosing what news to elevate to the national level.
At some point, most blogs are really locally focused, and I don't see how any of the personally-run small sites (like, say, my blog) could ever hope to break interesting news more than once in a lifetime. Also, I like writing about what I ate for dinner. That isn't news. :)
May 15, 2008
Licensed to drive: Making Japanese streets more dangerous
- A valid Driver's License from a reputable country
- A translation of the Driver's License
- At least for me, I needed a certified copy of my driving record from the Texas DMV because the driver's license does not have a date of issue field on it. I also needed a translation of that.
- Your passport
May 11, 2008
Updates around Jiyugaokaother unlikely food combinations that I've come across before. (Although, on that note I did have a Spicy Wiener Doughnut this morning. It sounds like it wouldn't be very good, but it actually was quite nice. The hot dog was great, and a bit spicy, with the same kind of sweet fried exterior that you would find on the misleading Curry Bread that pretend to be sweet jelly-filled doughnuts, but totally are not.) Back on topic with the New York Doughnut Plant, I was really excited to check it out but it is closed! They are remodeling and will be until sometime in September! I'll have moved away from my beloved Jiyugaoka by then! Oh noes! No doughnuts for me. Next up on the docket are the new express trains on the Oimachi line. I often take the Oimachi line from Oyamadai (my beloved small town where I live) to Jiyugaoka (the nearby "big" station) where I can transfer to the more respectable Toyoko line. I take it every day to go to work. The Oimachi line is pretty small, only 10km from end to end. If the weather is nice, I'll just walk. I live very close to the tracks, basically separated only by a narrow road, so I hear the trains go by all the time. One thing that really surprised me is that these new express trains are much less noisy than the local trains. They look pretty cool too. I'm going to miss the noise of trains going by every ten minutes or so. After about two years, it is strange how comforting the noise has become. I know that if I haven't heard trains go by in a while, I really need to get to bed. And they give you a good incentive to wake up when you really should be awake. Finally, one shot of the Cherry Blossoms in Jiyugaoka. I made it through the whole cherry blossom season without posting any photos (I do have a bunch that I could upload, but it seems very clichéd.) I kind of like the cherry blossom trees that line the Jiyugaoka shopping street.
Mabo Tofu: Cheap and easy food
May 9, 2008
More earthquakes in JapanSo I'm a bit late on this -- I meant to write something last night, but never got around to it -- but there was a medium-sized earthquake two nights ago (Wednesday night.) Actually, it was early Thursday morning, at about 1:45am. I was in bed, somewhat asleep, when my dreams started to become strange and wave-like. The entire bed was swaying. There have been a few small earthquakes in the two years that I've been in Tokyo, but this was one of the bigger ones. In my strange dream-like logic, I thought "Should I wake up and run outside?" I convinced myself that the rocking, wavelike motion was kind of pleasant, and somehow thought that I was a on a nice boat.
I feel back deep asleep again shortly.
It was apparently a fairly big earthquake - showing up on the the local blogs.
Checking some of the earthquake report sites it looks like it was a 6.8 on the US scale, and a 4 or 5 on the Japanese scale.
Anyway, things were fine in my place. I think a pen rolled off one of my tables, but that was about it. I'm up on the 4th floor of a 4-story concrete "mansion", and that thing really gets swaying!
I'm a little worried now thinking about "the big one". Benkei is convinced that the big one is coming and will hit Tokyo soon. A friend of mine at work said the same thing. I'm not so sure: I don't really know much about how earthquakes work. Since I'm in the process of buying a place here though, I've decided that the big one is not coming, and our new place will be built to withstand it even if it does hit.
Unbridled optimism is a great power if only you can harness it correctly!
May 6, 2008
How I spent my summer vacationThis week was "Golden Week", a sequence of usually 4 days off at the start of May. This is one of the big holidays in Japan, and people usually have big plans for the break, including overseas travel and the like. The gas prices always seem to go up a bit before Golden Week, and miraculously return to normal a few days afterwards. This year, things are even crazier because of the reinstatement of the lapsed "temporary gas tax" (temporary since 1954! - well, 1974 for the actual temporary part, but there have been gas taxes in effect since 1954.), which are about 50 yen a liter.
I don't really want to write about that - it was reinstated in a recent push by the upper house that overrode the lower house and probably angered a lot of people - but just wanted to quickly jot down the things I've been doing on my "vacation".
I wanted to get some work done over the four day break. Usually the holidays are four weekdays, but timing is particularly bad this year, and two of the holidays fell on the weekend, so we only got Monday and Tuesday off. I brought my work laptop home so I could do some programming, but as often happens high hopes clashed with reality and I didn't get as much done as I wanted.
On Saturday I slept in, and did some needed cleaning around the apartment. I watching some TV, and started looking at my email backlog (that always takes longer than expected.) I have family coming to visit soon and spent some time planning for that, and then in the evening decided that I would take advantage of the holiday and play a game. Quite a while ago I bought Galactic Civilizations 2, a space war and conquest game. It is really cool, and I was totally obsessed with it for a while. Because I'm the obsessive type, I just drop things cold-turkey when I think they are taking up too much of my time. I think by the time I realized what had happened, it was 6am on Sunday morning, and I had wasted about 14 hours in a blink.
So I've put that away again, and won't go back until I can learn to set a timer or something.
I woke up Sunday afternoon, went down to the local cafe for lunch, and read a bit more of "Kafka By the Sea", the Murakami Haruki novel I've been working on for over a year now. I'm about a quarter of the way into the second book and it is really getting interesting. I wish I had more time to devote to reading in Japanese - it is a great way to study, and lots of fun - but it seems hard to take the time. (I say, as I blog.)
Back at home I spent a few hours reading up on the FrameNet project, which I'm using in some research. The framenet book is pretty large, but does not cover the xml file format and all that stuff. Since the data files are pretty large, it is a daunting task to dive in and start using them right away. I'm very surprised that there are not more well-developed interfaces to accessing FrameNet in Java - there are two that are kind of old, and don't look very well adopted, unlike the state of affairs for WordNet, which has many Java interfaces.
On Sunday night I went out and met a friend for dinner, which was lots of fun. I've been cooking at home a lot lately and it is really nice to eat something that I didn't make myself. I have started making lots of Mabo tofu lately, which is pretty good and very easy to make.
Monday I spent some time doing work email, and a bit of programming (getting a feature set developed for a CRF learning system) and in the evening I went to my finaceé's parent's place for dinner. Unfortunately, L. was sick, and holed up in bed. Still, dinner with the in-laws was nice (more food I didn't cook!) and I helped L.'s dad set up his email on a new computer.
Today I've spent the day working with a FrameNet semantic labeler from the NLP group at Lunds University. It looks really great, because it will save me the trouble of having to build a learning system to map from syntactic dependencies to semantic roles. Since that is a pretty tough project in itself, I wouldn't be able to do it justice because really I just want to use a bit of the semantic information in FrameNet for sentiment analysis.
As vacations go, it was nice because I spent maybe two days goofing off, and then a bit of time working but in a more relaxed environment than usual.
Unusual things that happened while walking around Oyamadai: another new restaurant opened up. I'm amazed at how quickly they tear down and construct new buildings around here. For the first time since I've been here, I received a benefit for having signed up for the local supermarket's "rewards card" thing. They gave me a coupon for 500 yen off. Not bad. That's almost a pint of Hagen Daaz ice cream here. (Or two gallons of Blue Bell back in Dallas.)
While I was dozing off in the afternoon, I heard something strange out my window: English! I poked my head out and it looks like another foreigner is moving in nearby. Neat.
April 23, 2008
The Balanced Corpus of Contemporary Written Japanese and Japanese Blog DataToday I went to a brief introduction talk about the plans to release a corpus of Japanese blog data for research use. The presentation was at the National Institute of Informatics, with a panel of Professor Toukura and Professor Oyama from NII, MAEKAWA Kikuo from The National Institute for Japanese Language, and a representative from Yahoo! Japan's blog division (I didn't catch his name, sorry.)
There were a lot of people there, about 30 or so all told. The purpose of the presentation was to introduce the plans to make a corpus of Japanese blog data available for research use. The presentation wasn't too detailed about what exactly will be released, but the current plan is to make the data available to researchers in July of 2008. The data consists of post entries from the Yahoo! Blog service where the users have agreed to allow their data to be collected and used in such a manner. The post comments are not included in the data, and the corpus will possibly have things like proper nouns anonymized and other things done to protect the privacy of the people in the data. It is really nice to see people thinking about putting together this kind of data for research use. I haven't found a URL for the project or I would post that - the contact section of the handout says to email Professors Toukura, Professor Oyama, or Mr. Maekawa, but I suspect there will be information on the main NII homepage about the data release when the time comes.
In addition, Mr. MAEKAWA spoke a bit about the Balanced Corpus of Contemporary Written Japanese, which looks very interesting. The project to build the corpus runs from 2006 to 2010, so they are only about two years into the project right now, but it is looking to be something like a Brown corpus for Japanese. It contains three sub-corpora, published material from 2001~2005 (magazines, newspapers, and books) and material from 1986 - 2005 from library sources (books mostly it looks like), and a mixed domain sub-corpus with web data, white papers, text books, records from Diet meetings, best seller novels, and so on.
This post isn't really all that content bearing, but there was only very useful resource that Mr. MAEKAWA mentioned in his talk: the demo of the KOTONOHA Corpus of Modern Japanese Search system (actual entrance is on a button click from the description page.) This is exactly what Alex was asking about in one of his posts: a Japanese KWIC (Key Word In Context) search.
I don't know how long that demo will be available, but it is totally great for language learners or generally people who don't know colloquial usage. I tried poking around at it a bit, putting in a few terms but didn't come up with anything too interesting. I liked めんど as a search term because there were lots of hits, some showing it used more as めんどう and others the shorter めんど, often with a くさい not too far behind...
Anyway, that demo search could be a useful tool for non-native Japanese speakers. I'll add it to my toolkit of places to check when I'm mystified.
Now if only someone would make a Geinojin info site that would tell me *why* that person is famous and should be a guest on some panel, that would be great. (I currently use Wikipedia for that, but I would be happier with something that just says X: comedian, Y: famous lawyer, etc.)
April 16, 2008
First impressions of OSX 10.5 Leopard and Time CapsuleA while ago I bought a 500GB Time Capsule and Leopard at work to use as a backup solution. It took me a while to find the time, but I had a lot of papers to read recently so I installed Leopard and set up the Time Capsule while reading the papers.
First off, I'm really impressed with Time Capsule. It costs only a bit more than an external hard drive, but had a Gigabit Ethernet 3-port switch and 802.11n wireless. It feels very solid, is small, and is very, very quiet. I have an external IO Data 500GB hard drive right next to the Time Capsule, and it just drowns it out. Even after turning off that drive, I had a hard time hearing the Time Capsule. I'm really shocked at how quiet it is.
Setting the Time Capsule up was really simple, Zero Conf is just great for getting things on a network and making it easy to find them. Since we've got wireless at work I turned off the wireless interface, and just used it to extend the wired connection I already had. Once I set up Leopard on my machine, I started the Time Machine backup, and I have to say again that I am really impressed with how quiet the drive is: I had to listen pretty hard to hear the write noise. I was using ethernet plugged into the Time Capsule for the backup, so I was surprised that I was only getting about 10 MB/sec (sometimes up to 12) to the drive, which surprised me. The Gig-E connection should be able to support 125 MB/sec. Well, not really of course, but 10 MB/sec is an order of magnitude less than I expected!
Interestingly, the 802.11n interface should typically get (according to Wikipedia, so who knows if this is true) about 9.25 MB/sec, or about what I was seeing with the ethernet connection! Wow. Now I want to get one of those Time Capsules at home...
I'm very, very impressed with Time Machine. I haven't played with going back in time for the recovery stuff yet, but it looks like it will be great. I had been using an rsync-based backup solution that would use hard links to not duplicate files that have changed, but there are some problems with that solution. It works very nicely, but every four hours I run the backup script, and for about ten minutes the hard disk thrashes madly as rsync runs down the file tree looking for new files. I had it in a cron job, and nice'd the process so the machine is still totally usable, but there is a noticeable drop in performance, and sometimes you get the dreaded beach-ball while it thinks about file operations.
Time Machine uses a very similar approach, but Apple does some other magic that lets them link directories (not possible with standard Unix tricks as far as I know) and more importantly uses a DBUS-style notification system that tracks file operations, and keeps a list of things that have changed since the last backup. It doesn't have to check the whole file system for changes, it just has a list of things that have changed. The backup is impressively fast. I'm really impressed with how easy it all comes together: this is a consumer solution. There are no real drawbacks: the backup is so fast you wouldn't notice it happening (after the initial full backup) and the drive is so quiet that you can just forget it.
That's all I have to say about that. There are some other things about Leopard that I've noticed that I like:
- The "Spotlight" window is now a real finder window and you can use exposé to find it. In Tiger if you are in finder and do a "show application windows" the Spotlight window does not exist!
- Spotlight is a bit faster now for launching applications, which I use all the time.
- I like the coverflow view for windows much more than I expected. Being able to see large views of PDFs makes it really nice to look for a paper. It also helps recognizing Word and Excel documents quickly much more than I expected it would.
- They fixed the annoying "yellow cursor bug" in the X11 server. Yay!
Here are some annoyances:
- Mail.app again defaults to sending Japanese email in UTF-8. Generally, I think this is a good idea, but for some reason if email isn't in ISO-2022-JP for Japanese a lot of mail clients turn it into gibberish (mojibake). What really surprised me is that this happened with someone on Windows Vista - I don't know what client he uses, but if you are on Vista shouldn't your email client be able to read the headers and use UTF-8? Most mobile phones can only accept ISO-2022-JP, but I would think big computers could deal with it fine. By the way, to set the default encoding for Mail.app, you can enter "defaults write com.apple.mail NSPreferredMailCharset "ISO-2022-JP"" in a Terminal window. I vaguely remember having to do something like this on Tiger as well.
- I don't know if they fixed this, but at one point after a recent Security update in Tiger, all text pasted into Mail.app lost carriage returns. It was awful. There are ways to work around it (paste into Text Edit first, make rich text, paste into Mail.app, then make it plain text again) and so on, but that is annoying. I'm sure I'll notice this pretty quickly because I'm always pasting text into email. But I haven't noticed it yet. (I hope it is fixed.)
Go to Page: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22