[MacRuby-devel] Strings, Encodings and IO

Charles Oliver Nutter charles.nutter at sun.com
Wed Apr 8 10:34:09 PDT 2009

Vincent Isambart wrote:
>> the test_string tests in Ruby 1.9 repository do seem to mostly function
> You mean test/ruby/test_m17n.rb, test/ruby/test_m17n_comb.rb,
> test/ruby/test_io_m17n.rb and test/ruby/enc/test_*.rb?
> test/ruby/test_string.rb does not contain anything m17n related.
>> We simply could not match Ruby regex exactly until we ported the same engines Ruby uses :(
> No two regexp engines have the same behavior, there's nothing anyone
> can do about that...

Well, the problems we ran into is that those behavioral differences 
hindered our ability to run stuff like Rails. We didn't really have a 

> It looks like Oniguruma has a support for UTF-16 so I was thinking
> about using that in MacRuby. But as Oniguruma sees everything as a
> list of bytes, I do not know if you could use the Oniguruma UTF-16
> support without modifying your Oniguruma port.

Yes, I have talked with Marcin about us doing a separate fork of "JOni" 
that works with Java's UTF-16 characters directly. I think it could 
become the best Java regexp engine.

- Charlie

