马步鱼是什么鱼| 1月2日什么星座| 个性化是什么意思| 癌抗原125是什么意思| 巴雷特是什么| 成功是什么| 油墨用什么可以洗掉| 破处是什么感觉| 人体缺少蛋白质会有什么症状| 笑靥是什么意思| spi是什么| 放量十字星是什么意思| 扁桃体发炎吃什么消炎药| 蒙字五行属什么| 女生什么时候最想要| 玫瑰花泡水喝有什么功效| 飞短流长是什么意思| pa是什么材料| 侍妾是什么意思| 奶瓶pp和ppsu有什么区别| 打新股需要什么条件| 低脂是什么意思| 西游记是什么朝代| 吃什么补钙最好| 麦芯粉是什么面粉| 胃动力不足是什么原因造成的| 谷草转氨酶偏低是什么原因| 感冒发烧挂什么科室| 孕妇脚抽筋是什么原因| 天干地支是什么意思| 贴脸是什么意思| 炭疽病用什么药| 什么是家庭| 小腹痛男性什么原因| 酒精和碘伏有什么区别| 什么叫肺部纤维灶| 道场是什么意思| 邓超的老婆叫什么名字| 早搏吃什么药最好| 什么龙戏珠| 宝宝吃益生菌有什么好处和坏处| 为什么呢| 吃氨糖有什么副作用| 舒筋健腰丸为什么这么贵| tf卡是什么卡| 介入是什么意思| 什么的歌| 胃窦病变意味着什么| 鼻子痒是什么原因| 牛肉含有什么营养成分| 西晋之后是什么朝代| 灵魂伴侣是指什么意思| 黑毛茶是什么茶| 肠衣是什么做的| 美国为什么支持以色列| 男女身份证号码有什么区分| 乌龟和甲鱼有什么区别| 人为什么会打呼噜| 10月30号是什么星座| 丹毒不能吃什么| 徐峥的老婆叫什么名字| 糖类抗原高是什么意思| 调理内分泌失调吃什么药效果好| 龙抄手是什么| 10月份什么星座| 五十路是什么意思| 老放屁是什么病的征兆| 莲雾吃了有什么好处| 桂枝茯苓丸治什么病| 肚子痛挂什么科| 所剩无几是什么意思| 姓许的女孩取什么名字好听| 两弹一星指什么| 知柏地黄丸治疗什么病| 双肺纹理增多增粗是什么病| 看腋臭挂什么科| 梦到人死了是什么意思| 什么血型招蚊子咬| 老是犯困是什么原因| 吴亦凡什么星座| 经常喝苏打水有什么好处和坏处| 臭男人是什么意思| 先天是什么意思| 滢是什么意思| 最多笔画的汉字是什么| 手腕扭伤挂什么科| 薏米是什么| 跟风是什么意思| 什么是劣药| 一什么金鱼| 鸡枞菌生长在什么地方| 新生儿什么时候能看见东西| 孕妇忌吃什么| 办离婚需要什么手续和证件| 草鱼吃什么草| 阴茎不硬吃什么药| 草字头加果念什么| 双子座上升星座是什么| 梦见倒房子是什么预兆| 丙型肝炎吃什么药最好| 电瓶车充不进电是什么原因| 蛔虫属于什么动物| 有情人终成眷属是什么意思| 武则天是什么生肖| 大舌头是什么意思| 五体投地是什么意思| 怀孕肚子会有什么反应| 聪明是什么意思| 老年人头晕是什么原因| 嘴唇发紫是什么原因引起的| 喇叭裤配什么上衣| 什么叫负氧离子| 集训是什么| 死库水是什么意思| 有什么菜| 挂号是什么意思| 吃什么水果减肥最快| 16年属什么| 老二是什么意思| 打嗝是什么原因引起的| 剖腹产可以吃什么水果| 济公原名叫什么| 喜用神是什么| 闯空门什么意思| 灯五行属什么| 受凉胃疼吃什么药| 阴囊湿疹用什么药膏| 牙龈上火是什么原因引起的| 老年痴呆症又叫什么| 怀疑心衰做什么检查| 甲状腺结节挂什么科| 安字属于五行属什么| 吃饭恶心想吐是什么原因| 亲什么意思| 5.29是什么星座| 美籍华裔是什么意思| 男人说冷静一段时间是什么意思| 不讲武德什么意思| 梦到下雨是什么意思| 舌炎是什么原因引起的怎样治疗| 倦怠期是什么意思| 孟母三迁告诉我们什么道理| 夭寿是什么意思| 粉红色泡沫样痰是什么病| 什么宽带网速快又便宜| 固执什么意思| 血压低吃什么补| 黄豆什么时候播种| 种马是什么意思| 意外是什么意思| 献血浆有什么好处| 出水痘能吃什么食物| mrcp是什么检查| 放屁很臭是什么原因| 老年人屁多是什么原因| 眼睛黄是什么原因| 检点是什么意思| 贡品是什么意思| EE什么意思| 大校军衔相当于什么官| 潴留性囊肿是什么意思| 冬瓜和什么不能一起吃| hbsag阳性什么意思| 送妈妈什么礼物好| 梦到女朋友出轨是什么意思| 腹水是什么| 喝三七粉有什么好处| 血压高的表现症状是什么| 健康证什么时候可以办| 样板间是什么意思| 吃什么去肝火最快| 肚子疼是什么原因引起的| 非诚勿扰什么意思| 关羽的武器叫什么| 电饭煲内胆什么材质好| 建军节是什么时候| 畏寒是什么意思| 看演唱会需要准备什么| 吹箫是什么意思| 昆仑玉什么颜色最贵| 什么样的普洱茶才是好茶| 橡皮擦是什么材料做的| 心电图逆钟向转位是什么意思| 过山风是什么蛇| 四相是什么意思| 男人小腿肿是什么原因| 郎才女貌是什么意思| 荷尔蒙是什么东西| 农历七月二十什么日子| 妊娠试验阴性是什么意思| 灰指甲挂什么科室| 睡觉被憋醒是什么原因| 舌苔白厚腻吃什么药见效快| 空调多少匹是什么意思| 7月15是什么节| 悬钟为什么叫绝骨| 舌头发白有齿痕是什么原因| 右耳朵耳鸣是什么原因| 鲜牛奶和纯牛奶有什么区别| 早上五六点是什么时辰| 处女膜破了什么症状| 安陵容为什么叫安小鸟| 无所适从什么意思| 螃蟹吐泡泡是什么原因| 二代试管是什么意思| 肚子疼恶心想吐吃什么药| 卵巢囊肿吃什么药| 结婚下雨有什么说法| 做梦梦到老婆出轨是什么意思| 眼发花是什么病的征兆| 肺不好的人吃什么好| 孕妇放屁多是什么原因| 小本创业做什么生意好| 奇葩什么意思| 什么牌子的麦克风好用| 不均质回声是什么意思| 砂舞是什么意思| 拉拉是什么| 为什么会有蟑螂| 手掌心发红是什么原因| 甲沟炎什么症状| 1930年属什么生肖| 木糖醇是什么| 净身高是什么意思| hbaic是什么意思| 心理障碍是什么病| 下午三点多是什么时辰| 脱落细胞学检查是什么| 打呼噜什么原因| 结膜出血用什么眼药水| 蝉吃什么东西| 企鹅是什么动物| 唇釉是什么| 为什么会有痛经| 857什么意思| 三氧化硫常温下是什么状态| 为什么直系亲属不能输血| 女人梦见鬼是什么征兆| 阳历7月15日是什么星座| 1993属什么生肖| 农村一般喂金毛吃什么| 梦魇是什么| 夏天结婚新郎穿什么衣服图片| 柚子什么时候成熟| 痛风会在膝盖什么位置| 牙掉了是什么预兆| wuli是什么意思| 救世主是什么意思| 乙肝两对半挂什么科| 南京立秋吃什么| 石榴叶子泡水喝有什么功效| 享年是什么意思| 身上长红色的痣是什么原因| 薏米不能和什么一起吃| 1901年是什么年| 地球为什么是圆的| 凝血酶时间是什么意思| 孕妇缺铁吃什么食物好| 嘈杂纳减是什么意思| 打招呼是什么意思| 六月初二是什么日子| 长期戴耳机有什么危害| 局座是什么梗| 一个句号是什么意思| 百度

国足惨败足协不怪里皮

百度 (记者张程)

In computing and telecommunications, a character is the encoded representation of a natural language character (including letter, numeral and punctuation), whitespace (space or tab), or a control character (controls computer hardware that consumes character-based data). A sequence of characters is called a string.

Diagram of String data in computing. Shows the word "example" with each letter in a separate box. The word "String" is above, referring to the entire sentence. The label "Character" is below and points to an individual box.
A string of seven characters

Some character encoding systems represent each character using a fixed number of bits whereas other systems use varying sizes. Various fixed-length sizes were used for now obsolete systems such as the six-bit character code,[1][2] the five-bit Baudot code and even 4-bit systems (with only 16 possible values).[3] The more modern ASCII system uses the 8-bit byte for each character. Today, the Unicode-based UTF-8 encoding uses a varying number of byte-sized code units to define a code point which combine to encode a character.

Terminology

edit

Character

edit

In general, a character is a symbol (such as a letter or number) that represents information, and in the context of computing is a representation of such a symbol that may be accepted by a computer.[4] A character implies an encoding of information; often as defined by a standard such as ANSI or Unicode.

Character set

edit

A character set identifies a repertoire of characters that are each mapped to a unique numeric value.

Glyph

edit

Glyph describes a particular visual appearance of a character. Many computer fonts consist of glyphs that are indexed by the numerical code of the corresponding character.

With the advent and widespread acceptance of Unicode[5] and bit-agnostic coded character sets,[clarification needed] a character is increasingly being seen as a unit of information, independent of any particular visual manifestation. The ISO/IEC 10646 (Unicode) International Standard defines character, or abstract character as "a member of a set of elements used for the organization, control, or representation of data". Unicode's definition supplements this with explanatory notes that encourage the reader to differentiate between characters, graphemes, and glyphs, among other things. Such differentiation is an instance of the wider theme of the separation of presentation and content.

For example, the Hebrew letter aleph ("?") is often used by mathematicians to denote certain kinds of infinity (?), but it is also used in ordinary Hebrew text. In Unicode, these two uses are considered different characters, and have two different Unicode numerical identifiers ("code points"), though they may be rendered identically. Conversely, the Chinese logogram for water ("水") may have a slightly different appearance in Japanese texts than it does in Chinese texts, and local typefaces may reflect this. But nonetheless in Unicode they are considered the same character, and share the same code point.

The Unicode standard differentiates between these abstract characters and coded characters or encoded characters that have been paired with numeric codes that facilitate their representation in computers.

Combining character

edit

The combining character is addressed by Unicode which allocates a code point to each of:

  • 'i?' (U+0069),
  • the combining diaeresis (U+0308), and
  • '?' (U+00EF).

This makes it possible to code the middle character of the word 'na?ve' either as a single character '?' or as a combination of the character 'i?' with the combining diaeresis: (U+0069 LATIN SMALL LETTER I + U+0308 COMBINING DIAERESIS); this is also rendered as 'i??'.

char

edit

In C, char (short for character) is a data type with size one byte,[6][7] but unlike the defacto size of byte as 8 bits, this use of byte is less specific. Byte is defined to be large enough to contain any member of the "basic execution character set". The number of bits used by a compiler is accessible via the CHAR_BIT macro. By far the most common size is 8 bits, and POSIX requires it to be 8 bits.[8] In modern C standards, char is required to hold UTF-8 code units[6][7] which requires a minimum size of 8 bits.

Since a Unicode code point may require as many as 21 bits.[9] the char type is generally not large enough for every character. Nonetheless, the char type is well-suited for the UTF-8 encoding where each code point requires 1 to 4 bytes.

The fact that a character was historically stored in a single byte has led to the terms "char" and "character" being used interchangeably and this leads to confusion today when multibyte encodings such as UTF-8 are used. Modern POSIX documentation attempts to fix this by defining "character" as a sequence of one or more bytes representing a single graphic symbol or control code, and uses "byte" when referring to char data.[10][11] However it still contains errors such as defining an array of char as a character array (rather than a byte array).[12]

Unicode can be stored in strings of code units that are larger than char; called wide characters. The original C type was called wchar_t. Due to some platforms defining wchar_t as 16 bits and others defining it as 32 bits, current versions provide unambiguous char16_t and char32_t. Even then the objects being stored might not be characters, for instance the variable-length UTF-16 is often stored in arrays of char16_t.

Other languages also have a char type. Many, including C++, use 8-bit bytes like C.[7] Others, such as Java, use 2-byte, wide storage to more directly accommodate UTF-16.

See also

edit

References

edit
  1. ^ Dreyfus, Phillippe (1958). "System design of the Gamma 60". Managing Requirements Knowledge, International Workshop on, Los Angeles. New York. pp. 130–133. doi:10.1109/AFIPS.1958.32. […] Internal data code is used: Quantitative (numerical) data are coded in a 4-bit decimal code; qualitative (alpha-numerical) data are coded in a 6-bit alphanumerical code. The internal instruction code means that the instructions are coded in straight binary code.
    As to the internal information length, the information quantum is called a "catena," and it is composed of 24 bits representing either 6 decimal digits, or 4 alphanumerical characters. This quantum must contain a multiple of 4 and 6 bits to represent a whole number of decimal or alphanumeric characters. Twenty-four bits was found to be a good compromise between the minimum 12 bits, which would lead to a too-low transfer flow from a parallel readout core memory, and 36 bits or more, which was judged as too large an information quantum. The catena is to be considered as the equivalent of a character in variable word length machines, but it cannot be called so, as it may contain several characters. It is transferred in series to and from the main memory.
    Not wanting to call a "quantum" a word, or a set of characters a letter, (a word is a word, and a quantum is something else), a new word was made, and it was called a "catena." It is an English word and exists in Webster's although it does not in French. Webster's definition of the word catena is, "a connected series;" therefore, a 24-bit information item. The word catena will be used hereafter.
    The internal code, therefore, has been defined. Now what are the external data codes? These depend primarily upon the information handling device involved. The Gamma 60 [fr] is designed to handle information relevant to any binary coded structure. Thus an 80-column punched card is considered as a 960-bit information item; 12 rows multiplied by 80 columns equals 960 possible punches; is stored as an exact image in 960 magnetic cores of the main memory with 2 card columns occupying one catena. […]
  2. ^ Blaauw, Gerrit Anne; Brooks Jr., Frederick Phillips; Buchholz, Werner (1962), "4: Natural Data Units" (PDF), in Buchholz, Werner (ed.), Planning a Computer System – Project Stretch, McGraw-Hill Book Company, Inc. / The Maple Press Company, York, PA., pp. 39–40, LCCN 61-10466, archived (PDF) from the original on 2025-08-05, retrieved 2025-08-05, […] Terms used here to describe the structure imposed by the machine design, in addition to bit, are listed below.
    Byte denotes a group of bits used to encode a character, or the number of bits transmitted in parallel to and from input-output units. A term other than character is used here because a given character may be represented in different applications by more than one code, and different codes may use different numbers of bits (i.e., different byte sizes). In input-output transmission the grouping of bits may be completely arbitrary and have no relation to actual characters. (The term is coined from bite, but respelled to avoid accidental mutation to bit.)
    A word consists of the number of data bits transmitted in parallel from or to memory in one memory cycle. Word size is thus defined as a structural property of the memory. (The term catena was coined for this purpose by the designers of the Bull GAMMA 60 [fr] computer.)
    Block refers to the number of words transmitted to or from an input-output unit in response to a single input-output instruction. Block size is a structural property of an input-output unit; it may have been fixed by the design or left to be varied by the program. […]
  3. ^ "Terms And Abbreviations". MCS-4 Assembly Language Programming Manual - The INTELLEC 4 Microcomputer System Programming Manual (PDF) (Preliminary ed.). Santa Clara, California, US: Intel Corporation. December 1973. pp. v, 2-6. MCS-030-1273-1. Archived (PDF) from the original on 2025-08-05. Retrieved 2025-08-05. […] Bit - The smallest unit of information which can be represented. (A bit may be in one of two states I 0 or 1). […] Byte - A group of 8 contiguous bits occupying a single memory location. […] Character - A group of 4 contiguous bits of data. […] (NB. This Intel 4004 manual uses the term character referring to 4-bit rather than 8-bit data entities. Intel switched to use the more common term nibble for 4-bit entities in their documentation for the succeeding processor 4040 in 1974 already.)
  4. ^ "Definition of CHARACTER". Merriam-Webster. Retrieved 2025-08-05.
  5. ^ Davis, Mark (2025-08-05). "Moving to Unicode 5.1". Google Blog. Retrieved 2025-08-05.
  6. ^ a b "§5.2.4.2.1 Sizes of integer types <limits.h> / §6.2.5 Types / §6.5.3.4 The sizeof and _Alignof operators". ISO/IEC 9899:2018 - Information technology -- Programming languages -- C. {{cite book}}: |website= ignored (help)
  7. ^ a b c "§1.7 The C++ memory model / §5.3.3 Sizeof". ISO/IEC 14882:2011.
  8. ^ "<limits.h>". pubs.opengroup.org. Retrieved 2025-08-05.
  9. ^ "Glossary of Unicode Terms – Code Point". Retrieved 2025-08-05.
  10. ^ "POSIX definition of Character".
  11. ^ "POSIX strlen reference".
  12. ^ "POSIX definition of Character Array".
edit
二五八万是什么意思 零反式脂肪是什么意思 小便赤黄是什么原因 疱疹是什么病啊 苦瓜对肝脏有什么好处
肌酐低什么原因 钙化斑是什么意思 酒店尾房是什么意思 孕妇胃疼可以吃什么药 剪刀石头布什么意思
家里为什么有蟑螂 人参果是什么季节的 表虚自汗是什么意思 大腿根部疼痛是什么原因 月经9天了还没干净是什么原因
不可漂白是什么意思 什么是肝炎 roa是什么意思 一什么二什么 梦见小男孩拉屎是什么意思
屁眼痒是什么原因520myf.com 贫血补什么xianpinbao.com 大便干吃什么药hcv8jop5ns0r.cn 为什么糙米越吃血糖越高hcv9jop7ns3r.cn 梦见很多人是什么意思hcv8jop5ns0r.cn
中国的国树是什么xscnpatent.com 粉荷花的花语是什么hcv9jop5ns0r.cn 胃息肉吃什么药hcv9jop7ns3r.cn 看静脉曲张挂什么科hcv8jop5ns1r.cn 入职体检70元一般检查什么hcv9jop3ns0r.cn
阴虱长什么样fenrenren.com 女性私处为什么会变黑hcv7jop9ns4r.cn 潜血十一是什么意思1949doufunao.com 葡萄不能和什么一起吃hcv7jop4ns5r.cn 望洋兴叹什么意思hcv9jop7ns4r.cn
舌头裂开是什么原因xianpinbao.com 什么心什么力hcv8jop8ns8r.cn 嘴唇上火吃什么药hcv8jop3ns1r.cn 上火了吃什么水果降火最快hcv8jop0ns9r.cn 天蝎座喜欢什么样的女生hcv8jop7ns6r.cn
百度