• Top
    • Documentation
    • Books
    • Boolean-reasoning
    • Projects
    • Debugging
    • Std
    • Proof-automation
    • Macro-libraries
    • ACL2
    • Interfacing-tools
    • Hardware-verification
    • Software-verification
      • Kestrel-books
        • Crypto-hdwallet
        • Apt
        • Error-checking
        • Fty-extensions
        • Isar
        • Kestrel-utilities
        • Set
        • Soft
        • C
        • Bv
        • Imp-language
        • Event-macros
        • Java
          • Atj
          • Aij
          • Language
            • Syntax
              • Grammar
              • Unicode-escapes
              • Unicode-input-char
              • Escape-sequence
              • Identifiers
              • Primitive-types
              • Reference-types
              • Keywords
              • Unicode-characters
                • Unicode
                • Ascii
                • String=>unicode
                • Ascii=>string
                • Ascii-list
                • Unicode-list
              • Integer-literals
              • String-literals
              • Octal-digits
              • Hexadecimal-digits
              • Decimal-digits
              • Binary-digits
              • Character-literals
              • Null-literal
              • Floating-point-literals
              • Boolean-literals
              • Package-names
              • Literals
            • Semantics
        • Bitcoin
        • Ethereum
        • Yul
        • Zcash
        • ACL2-programming-language
        • Prime-fields
        • Json
        • Syntheto
        • File-io-light
        • Cryptography
        • Number-theory
        • Lists-light
        • Axe
        • Builtins
        • Solidity
        • Helpers
        • Htclient
        • Typed-lists-light
        • Arithmetic-light
      • X86isa
      • Axe
      • Execloader
    • Math
    • Testing-utilities
  • Syntax

Unicode-characters

Unicode characters in Java [JLS14:3.1].

The Unicode standard distinguishes among `characters', `code points', and `code units'. In Java, characters are essentially Unicode UTF-16 code units, i.e. unsigned 16-bit values. In our formalization, as in [JLS14], we may use the terms `character', `code point', and `code unit' fairly interchangeably, when that causes no confusion.

Subtopics

Unicode
Fixtype of Java Unicode characters.
Ascii
Fixtype of ASCII characters.
String=>unicode
Convert an ACL2 string to a Java Unicode character list.
Ascii=>string
Convert a Java ASCII character list to an ACL2 string.
Ascii-list
Fixtype of lists of ASCII characters.
Unicode-list
Fixtype of lists of Java Unicode characters.