[OS X Emacs] Unicode regex

Peter Dyballa Peter_Dyballa at Web.DE
Fri Jan 18 04:35:46 EST 2008


Am 18.01.2008 um 02:20 schrieb Jean-Christophe Helary:

> Is there a syntax definition for the Han range (or any other  
> Unicode plane) in Emacs ?


I don't know. And I can't answer your question since I don't know  
where in Unicode this range is. With some more input I might be able  
to give better answers.

In GNU, Carbon, and Cocoa Emacsen and all other flavours you have the  
chance to *enter* every Unicode character. When it gets displayed as  
a box, then it's still valid, there is just no glyph available from  
some font to display it. The Mac OS X grep obviously is able to  
handle the whole Unicode range, at least the BMP (Basic Multilingual  
Plane). I have no other test cases to test any other plane. The  
mechanism to enter this characters is, for example, C-q <some octal  
code><SPC|RET|cursor>, i.e. you press C-q and then enter the octal  
code of that character. I think this input sequence ends when it  
reaches its natural limit of 177777 for the BMP. If you have to enter  
only a few digits, then you'll need to finish this "mode," which is  
every non-digit. It's also possible to make this mechanism accept  
numbers to the base of 16, so the value form the Unicode notation U 
+ABCD can be used.

--
Greetings

   Pete

With Capitalism man exploits man. With communism it's the exact  
opposite.





More information about the MacOSX-Emacs mailing list