• Top
    • Documentation
    • Books
    • Boolean-reasoning
    • Projects
      • Apt
      • Zfc
      • Acre
      • Milawa
      • Smtlink
      • Abnf
      • Vwsim
      • Isar
      • Wp-gen
      • Dimacs-reader
      • Pfcs
      • Legacy-defrstobj
      • Proof-checker-array
      • Soft
      • C
      • Farray
      • Rp-rewriter
      • Instant-runoff-voting
      • Imp-language
      • Sidekick
      • Leftist-trees
      • Java
        • Atj
        • Aij
        • Language
          • Syntax
            • Grammar
            • Unicode-escapes
            • Unicode-input-char
            • Escape-sequence
            • Identifiers
            • Primitive-types
            • Reference-types
            • Keywords
            • Unicode-characters
              • Unicode
              • Ascii
              • String=>unicode
              • Ascii=>string
              • Ascii-list
              • Unicode-list
            • Integer-literals
            • String-literals
            • Octal-digits
            • Hexadecimal-digits
            • Decimal-digits
            • Binary-digits
            • Character-literals
            • Null-literal
            • Floating-point-literals
            • Boolean-literals
            • Package-names
            • Literals
          • Semantics
      • Taspi
      • Riscv
      • Bitcoin
      • Des
      • Ethereum
      • X86isa
      • Sha-2
      • Yul
      • Zcash
      • Proof-checker-itp13
      • Regex
      • ACL2-programming-language
      • Json
      • Jfkr
      • Equational
      • Cryptography
      • Poseidon
      • Where-do-i-place-my-book
      • Axe
      • Aleo
      • Bigmems
      • Builtins
      • Execloader
      • Solidity
      • Paco
      • Concurrent-programs
      • Bls12-377-curves
    • Debugging
    • Std
    • Community
    • Proof-automation
    • ACL2
    • Macro-libraries
    • Interfacing-tools
    • Hardware-verification
    • Software-verification
    • Math
    • Testing-utilities
  • Syntax

Unicode-characters

Unicode characters in Java [JLS14:3.1].

The Unicode standard distinguishes among `characters', `code points', and `code units'. In Java, characters are essentially Unicode UTF-16 code units, i.e. unsigned 16-bit values. In our formalization, as in [JLS14], we may use the terms `character', `code point', and `code unit' fairly interchangeably, when that causes no confusion.

Subtopics

Unicode
Fixtype of Java Unicode characters.
Ascii
Fixtype of ASCII characters.
String=>unicode
Convert an ACL2 string to a Java Unicode character list.
Ascii=>string
Convert a Java ASCII character list to an ACL2 string.
Ascii-list
Fixtype of lists of ASCII characters.
Unicode-list
Fixtype of lists of Java Unicode characters.