admin 管理员组

文章数量: 1184232


2024年1月23日发(作者:大疆开启fcc教程)

Unicode Nearly Plain Text Encoding of Mathematics

Unicode Nearly Plain-Text Encoding of Mathematics

Version 3

Murray Sargent III

Publisher Text Services, Microsoft Corporation

10-Mar-10

1.

Introduction ............................................................................................................ 2

2.

Encoding Simple Math Expressions ...................................................................... 3

2.1

Fractions .......................................................................................................... 4

2.2

Subscripts 6

2.3

Use of the Blank (Space) Character ............................................................... 7

3.

Encoding Other Math Expressions ........................................................................ 8

3.1

Delimiters ........................................................................................................ 8

3.2

Literal Operators ........................................................................................... 10

3.3

Prescripts and Above/Below Scripts ........................................................... 11

3.4

n-ary Operators ............................................................................................. 12

3.5

Mathematical Functions ............................................................................... 13

3.6

Square Roots and Radicals ........................................................................... 13

3.7

Enclosures ..................................................................................................... 14

3.8

Stretchy Characters ....................................................................................... 15

3.9

Matrices ......................................................................................................... 16

3.10

Accent Operators ....................................................................................... 16

3.11

Differential, Exponential, and Imaginary Symbols ................................. 17

3.12

Unicode Subscripts and Superscripts ...................................................... 18

3.13

Concatenation Operators .......................................................................... 18

3.14

Comma, Period, and Colon ........................................................................ 18

3.15

Ordinary Text Inside Math Zones ............................................................. 19

3.16

Space Characters ....................................................................................... 19

3.17

Phantoms and Smashes ............................................................................ 21

3.18

Arbitrary Groupings .................................................................................. 22

3.19

Equation Arrays ......................................................................................... 22

3.20

Math Zones ................................................................................................. 22

3.21

Equation Numbers .................................................................................... 23

3.22

Linear Format Characters and Operands ................................................ 23

3.23

Equation Breaking and Alignment ........................................................... 26

3.24

Size Overrides ............................................................................................ 26

4.

Input Methods ...................................................................................................... 27

4.1

Character Translations ................................................................................. 27

4.2

Math Keyboards ............................................................................................ 29

4.3

Hexadecimal Input ........................................................................................ 29

4.4

Pull-Down Menus, Toolbars, Context Menus .............................................. 29

4.5

Macros ............................................................................................................ 30

4.6

Linear Format Math Autocorrect List .......................................................... 30

4.7

Handwritten Input ........................................................................................ 30

5.

Recognizing Mathematical Expressions ............................................................. 31

Unicode Technical Note 28

1

Unicode Nearly Plain Text Encoding of Mathematics

6.

Using the Linear Format in Programming Languages ....................................... 32

6.1

Advantages of Linear Format in Programs ................................................. 33

6.2

Comparison of Programming Notations ..................................................... 34

6.3

Export to TeX ................................................................................................. 36

7.

Conclusions ........................................................................................................... 37

Acknowledgements ..................................................................................................... 37

Appendix A. Linear Format Grammar ....................................................................... 38

Appendix B. Character Keywords and Properties .................................................... 39

Version Differences ..................................................................................................... 48

References .................................................................................................................... 48

1. Introduction

Getting computers to understand human languages is important in increasing

the utility of computers. Natural-language translation, speech recognition and gen-eration, and programming are typical ways in which such machine comprehension

plays a role. The better this comprehension, the more useful the computer, and

hence there has been considerable current effort devoted to these areas since the

early 1960s. Ironically one truly international human language that tends to be ne-glected in this connection is mathematics itself.

With a few conventions, Unicode1 can encode many mathematical expressions

in readable nearly plain text. Technically this format is a “lightly marked up format”;

hence the use of “nearly”. The format is linear, but it can be displayed in built-up

presentation form. To distinguish the two kinds of formats in this paper, we refer to

the nearly plain-text format as the linear format and to the built-up presentation

format as the built-up format. This linear format can be used with heuristics based

on the Unicode math properties to recognize mathematical expressions without the

aid of explicit math-on/off commands. The recognition is facilitated by Unicode’s

strong support for mathematical symbols.2 Alternatively, the linear format can be

used in “math zones” explicitly controlled by the user either with on-off characters

as used in TeX or with a character format attribute in a rich-text environment. Use of

math zones is desirable, since the recognition heuristics are not infallible.

The linear format is more compact and easy to read than [La]TeX,3,4 or

MathML.5 However unlike those formats, it doesn’t attempt to include all typograph-ical embellishments. Instead we feel it’s useful to handle some embellishments in

the higher-level layer that handles rich text properties like text and background col-ors, font size, footnotes, comments, hyperlinks, etc. In principle one can extend the

notation to include the properties of the higher-level layer, but at the cost of re-duced readability. Hence embedded in a rich-text environment, the linear format

can faithfully represent rich mathematical text, whereas embedded in a plain-text

environment it lacks most rich-text properties and some mathematical typograph-ical properties. The linear format is primarily concerned with presentation, but it

has some semantic features that might seem to be only content oriented, e.g., n-2

Unicode Technical Note 28

Unicode Nearly Plain Text Encoding of Mathematics

aryands and function-apply arguments (see Secs. 3.4 and 3.5). These have been in-cluded to aid in displaying built-up functions with proper typography, but they also

help to interoperate with math-oriented programs.

Most mathematical expressions can be represented unambiguously in the line-ar format, from which they can be exported to [La]TeX, MathML, C++, and symbolic

manipulation programs. The linear format borrows notation from TeX for mathe-matical objects that don’t lend themselves well to a mathematical linear notation,

e.g., for matrices.

A variety of syntax choices can be used for a linear format. The choices made in

this paper favor a number of criteria: efficient input of mathematical formulae, suffi-cient generality to support high-quality mathematical typography, the ability to

round trip elegant mathematical text at least in a rich-text environment, and a for-mat that resembles a real mathematical notation. Obviously compromises between

these goals had to be made.

The linear format is useful for 1) inputting mathematical expressions,6 2) dis-playing mathematics by text engines that cannot display a built-up format, and 3)

computer programs. For more general storage and interchange of math expressions

between math-aware programs, MathML and other higher-level languages are pre-ferred.

Section 2 motivates and illustrates the linear format for math using the fraction,

subscripts, and superscripts along with a discussion of how the ASCII space U+0020

is used to build up one construct at a time. Section 3 summarizes the usage of the

other constructs along with their relative precedences, which are used to simplify

the notation. Section 4 discusses input methods. Section 5 gives ways to recognize

mathematical expressions embedded in ordinary text. Section 6 explains how

Unicode plain text can be helpful in programming languages. Section 7 gives conclu-sions. The appendices present a simplified linear-format grammar and a partial list

of operators.

2. Encoding Simple Math Expressions

Given Unicode’s strong support for mathematics2 relative to ASCII, how much

better can a plain-text encoding of mathematical expressions look using Unicode?

The most well-known ASCII encoding of such expressions is that of TeX, so we use it

for comparison. MathML is more verbose than TeX and some of the comparisons ap-ply to it as well. Notwithstanding TeX’s phenomenal success in the science and engi-neering communities, a casual glance at its representations of mathematical expres-sions reveals that they do not look very much like the expressions they represent.

It’s not easy to make algebraic calculations by hand directly using TeX’s notation.

With Unicode, one can represent mathematical expressions more readably, and the

resulting nearly plain text can often be used with few or no modifications for such

calculations. This capability is considerably enhanced by using the linear format in a

system that can also display and edit the mathematics in built-up form.

Unicode Technical Note 28

3

Unicode Nearly Plain Text Encoding of Mathematics

The present section introduces the linear format with fractions, subscripts, and

superscripts. It concludes with a subsection on how the ASCII space character

U+0020 is used to build up one construct at a time. This is a key idea that makes the

linear format ideal for inputting mathematical formulae. In general where syntax

and semantic choices were made, input convenience was given high priority.

2.1 Fractions

One way to specify a fraction linearly is LaTeX’s frac{numerator}{denominator}.

The

{ } are not printed when the fraction is built up. These simple rules immediately

give a “plain text” that is unambiguous, but looks quite different from the corre-sponding mathematical notation, thereby making it harder to read.

Instead we define a simple operand to consist of all consecutive letters and

decimal digits, i.e., a span of alphanumeric characters, those belonging to the Lx and

Nd General Categories (see The Unicode Standard 5.0,1 Table 4-2. General Category).

As such, a simple numerator or denominator is terminated by most nonalphanumer-ic characters, including, for example, arithmetic operators, the blank (U+0020), and

Unicode characters in the ranges U+2200..U+23FF, U+2500..U+27FF, and U+2900 ..

U+2AFF. The fraction operator is given by the usual solidus / (U+002F). So the sim-ple built-up fraction

本文标签: 开启 大疆 教程 作者

更多相关文章

计算机必须设置默认打印机,电脑系统怎么默认打印机 默认打印机的设置教程...

3月前

打印机是每个办公室的常用设备之一,我们可以通过打印机将电脑中的重要文件打印出来,常见的比如人员表、个人简历、入职资料等等,如果我们想要将电脑系统中的文件打印出来呢?那么我们就

W11下载安装教程

3月前

官方地址   https:www.microsoftzh-cnsoftware-downloadwindows11  这个是官方的地址放心大胆的下载就可以了 下载这个Windows11安装助手 下载之后就可以直接打开文件夹

Realtek驱动重装步骤:超详细版教程

3月前

Realtek 驱动重装实战指南:从失声到完美音频的完整修复路径你有没有遇到过这样的情况——电脑突然没声音了,重启无效、音量正常、喇叭也没坏;或者插上耳机系统毫无反应&a

Multisim14.3安装教程:Win10Win11兼容性配置指南

3月前

如何在 Win10Win11 上成功安装 Multisim 14.3?绕过兼容性陷阱的实战指南你有没有试过在新电脑上安装 Multisim 14.3,结果刚点开 setup.exe 就被系统“拒

小白必学教程:解决Chrome浏览器的网页加载难题

2月前

目录前言 折磨了我近一个月的Chrome打开网页慢的问题终于在今天告一段落,完结撒花~然后觉得很有必要发一篇文章来指引误入歧途的后来人~ 互联网上大部分教程无非都是教你在浏览器设置里取消勾选一些选项,在局

笔记本键盘的秘密武器:fn键与f键的完美联动,你get了吗?

2月前

1、现在很多电脑的F1—F12快捷键都是音量、屏幕亮度、截图工具的调节功能。如果要使用的是原本的帮助键就要用Fn+F1—F12,在使用过程中可能会带来不便。 2、笔记本的键盘上也有锁定Fn键的快捷键,就是ESC键+Fn键,开启

MSTSC教程:快速学会远程控制他人PC的技巧

2月前

常用的:1. mstsc v: 192.168.0.1 连接数满了之后使用的:2. mstsc v: 192.168.0.1 console -admin Mstsc 命

新手必学教程:Win7中如何优雅地隐藏UAC警告

2月前

UAC是微软为 提高系统安全而在Windows Vista中引入的新技术,它要求所有用户在标准账号模式下运行程序和任务,阻止未认证的程序安装,并阻止或提示标准用户进行不当的系统设置改变。既然win7中uac用处多多,为何要关闭

崩溃模式开启!教你快速修复笔记本电脑键盘故障的神技能!

2月前

作为一个铁打的程序员,如果突然键盘用不了,那无异于打断了我们的双手,脑海中一大堆的代码想要溢出却无法打出来,简直无法想象!今天晚上由于手贱下载了个VMware的清理软件,不小心把键盘驱动的注册表信息也删除了,然后就各种GG,瞎紧张了半

笔记本与Wi-Fi的不解之缘:如何完美搭建无线连接环境?

2月前

好多同学买了笔记本后,会想到开个wifi,然后就可以用手机连接上网了,想想都有点小激动。。。 最简单的就是,去网上搜个360wifi,然后一键开启,随意更改,还方便管理(PS:不是打广告 但是:

告别手动操作!用批处理让笔记本自动连接WIFI

2月前

先上个成品图 成品就长这个样,不用工具纯手敲命令,每次都要写很长,看得懂些批处理,自己拼凑了一个小工具。 网络环境需要把在用的网卡设置成共享模式,开放的WIFI才有网。@echo offtitle 王三三

VMware workstation 12新手安装教程:快速解锁虚拟机操作技巧。

2月前

哈喽,大家好。今天一起学习的是VMware Workstation 12的安装,vm虚拟机是小编非常喜欢的生产力软件,小编之前发布的测试教程钧在vm上进行的实验。 VMware Workstation是一款功能强大的桌面虚拟计

VMware Workstation 12快速部署指南:一键安装不求人!

2月前

哈喽,大家好。今天一起学习的是VMware Workstation 12的安装,vm虚拟机是小编非常喜欢的生产力软件,小编之前发布的测试教程钧在vm上进行的实验。 VMware Workstation是一款功能强大的桌面虚拟计

Ubuntu教程:立刻学会快速显隐桌面的简便方法

2月前

在Ubuntu中输入文本的时候,发现无法输入字母d,每次按下字母d,就会回到桌面。按了很多次,然后恍然大悟,这是显示桌面的快捷键。 但这样的操作时影响正常使用的。需要改一下,标准的显示桌面快捷键是 Ctrl + Alt + d

手机做电脑的网络桥接器,为什么老是没反应?解答在这里!

2月前

000首先排除 一个原因是驱动签名原因, 手机驱动没有去做微软认证所以导致认证识别不给加载。 所以要禁用驱动签名:搜索栏搜索->高级启动->疑难解答->高级选项->高级启动->重启->

一步到位:轻松掌握微信视频号在电脑上发布链接的实用方法!

2月前

一、结果 复制到微信发送并打开可以直接跳转到视频号的视频上 二、教程1 安装Charles抓包 Charles安装、配置详情请看 首先打开Charles并配置好    点击扫把清空  方便等下找

Windows激活密钥教程:从新手到高手

2月前

win+R输入红色: 7|Ultimate|Acer| slmgr.vbs -ipk FJGCP-4DFJD-GJY49-VJBQ7-HYRR2 7|Ultimate|Dell| slmgr

铭瑄主板VT功能开启步骤详解:打造高效计算环境不是梦

2月前

铭瑄主板开启虚拟化技术(VT)详细教程在当今的计算机应用中,虚拟化技术(VT)已经成为不可或缺的一部分。无论是开发测试、系统部署,还是学习研究,虚拟机都为我们提供了极大的便利。而要在计算机上运行虚拟机,首先需要确保主板的

XMP内存技术实操指南:提升电脑性能不再难

2月前

内存是计算机与 CPU 进行沟通的桥梁,它是一台电脑不可或缺的硬件之一,重要性至关重要。而我们在选购内存的时候,会看到在一些内存参数中注明支持 X.M.P,那么电脑内存 XMP 是什么意思? 内存 XMP 是什么意思

XMP内存技术大揭秘:提升你的游戏和日常使用效率,从此告别卡顿!

2月前

内存是计算机与 CPU 进行沟通的桥梁,它是一台电脑不可或缺的硬件之一,重要性至关重要。而我们在选购内存的时候,会看到在一些内存参数中注明支持 X.M.P,那么电脑内存 XMP 是什么意思? 内存 XMP 是什么意思

发表评论

全部评论 0
暂无评论