如何将一句中文转换成unicode码？？在线结贴

eEric 2003-10-17 04:25:18

如何将一句中文转换成unicode码？？

...全文

317 6 打赏收藏转发到动态举报

写回复

用AI写文章

6 条回复

切换为时间正序

请发表友善的回复…

发表回复

eEric 2003-10-17

打赏
举报

那如何得到汉字‘丰’的unicode码4E30？？

goodboyws 2003-10-17

打赏
举报

cout<<outmsg<<endl;改成wcout<<outmsg<<endl

eEric 2003-10-17

打赏
举报

我的代码如下，为什么输出会有误呢？

char* str="t";
WCHAR outmsg[256];
MultiByteToWideChar(CP_ACP,0,str,strlen(str)+1,outmsg,sizeof(outmsg)
/sizeof(outmsg[0]) );
cout<<outmsg<<endl;

//输出为0012FD68

gzshd 2003-10-17

打赏
举报

csdn:

MultiByteToWideChar
The MultiByteToWideChar function maps a character string to a wide-character (Unicode) string. The character string mapped by this function is not necessarily from a multibyte character set.

int MultiByteToWideChar(
UINT CodePage, // code page
DWORD dwFlags, // character-type options
LPCSTR lpMultiByteStr, // address of string to map
int cchMultiByte, // number of bytes in string
LPWSTR lpWideCharStr, // address of wide-character buffer
int cchWideChar // size of buffer
);

Parameters
CodePage
Specifies the code page to be used to perform the conversion. This parameter can be given the value of any code page that is installed or available in the system. You can also specify one of the following values: Value Meaning
CP_ACP ANSI code page
CP_MACCP Macintosh code page
CP_OEMCP OEM code page
CP_SYMBOL Symbol code page (42)
CP_THREAD_ACP The current thread's ANSI code page
CP_UTF7 Translate using UTF-7
CP_UTF8 Translate using UTF-8

dwFlags
A set of bit flags that indicate whether to translate to precomposed or composite wide characters (if a composite form exists), whether to use glyph characters in place of control characters, and how to deal with invalid characters. You can specify a combination of the following flag constants: Value Meaning
MB_PRECOMPOSED Always use precomposed characters — that is, characters in which a base character and a nonspacing character have a single character value. This is the default translation option. Cannot be used with MB_COMPOSITE.
MB_COMPOSITE Always use composite characters — that is, characters in which a base character and a nonspacing character have different character values. Cannot be used with MB_PRECOMPOSED.
MB_ERR_INVALID_CHARS If the function encounters an invalid input character, it fails and GetLastError returns ERROR_NO_UNICODE_TRANSLATION.
MB_USEGLYPHCHARS Use glyph characters instead of control characters.

A composite character consists of a base character and a nonspacing character, each having different character values. A precomposed character has a single character value for a base/non-spacing character combination. In the character è, the e is the base character and the accent grave mark is the nonspacing character.

The function's default behavior is to translate to the precomposed form. If a precomposed form does not exist, the function attempts to translate to a composite form.

The flags MB_PRECOMPOSED and MB_COMPOSITE are mutually exclusive. The MB_USEGLYPHCHARS flag and the MB_ERR_INVALID_CHARS can be set regardless of the state of the other flags.

lpMultiByteStr
Points to the character string to be converted.
cchMultiByte
Specifies the size in bytes of the string pointed to by the lpMultiByteStr parameter. If this value is –1, the string is assumed to be null terminated and the length is calculated automatically.
lpWideCharStr
Points to a buffer that receives the translated string.
cchWideChar
Specifies the size, in wide characters, of the buffer pointed to by the lpWideCharStr parameter. If this value is zero, the function returns the required buffer size, in wide characters, and makes no use of the lpWideCharStr buffer.
Return Values
If the function succeeds, and cchWideChar is nonzero, the return value is the number of wide characters written to the buffer pointed to by lpWideCharStr.

If the function succeeds, and cchWideChar is zero, the return value is the required size, in wide characters, for a buffer that can receive the translated string.

If the function fails, the return value is zero. To get extended error information, call GetLastError. GetLastError may return one of the following error codes:

ERROR_INSUFFICIENT_BUFFER
ERROR_INVALID_FLAGS
ERROR_INVALID_PARAMETER
ERROR_NO_UNICODE_TRANSLATION

Remarks
The lpMultiByteStr and lpWideCharStr pointers must not be the same. If they are the same, the function fails, and GetLastError returns the value ERROR_INVALID_PARAMETER.

The function fails if MB_ERR_INVALID_CHARS is set and it encounters an invalid character in the source string. An invalid character is one that would translate to the default character if MB_ERR_INVALID_CHARS was not set, but is not the default character in the source string, or when a lead byte is found in a string and there is no valid trail byte for DBCS strings. When an invalid character is found, and MB_ERR_INVALID_CHARS is set, the function returns 0 and sets GetLastError with the error ERROR_NO_UNICODE_TRANSLATION.

Windows CE: Windows CE does not support the CP_UTF7 and CP_UTF8 values for the CodePage parameter.

QuickInfo
Windows NT: Requires version 3.1 or later.
Windows: Requires Windows 95 or later.
Windows CE: Requires version 1.0 or later.
Header: Declared in winnls.h.
Import Library: Use kernel32.lib.

eEric 2003-10-17

打赏
举报

能举一个例子吗， akiko(弥弥) ？？

akiko 2003-10-17

打赏
举报

MultiByteToWideChar

一、ASCII、机内码、区位码、国标码、Unicode码他们之间是如何转换的，方程式是什么汉字系统中的过程包括区位码、国标码和机内码，其中的转换关系如下： 1.区位码（十进制）转换成区位码（十六进制）。这里要把前两个位换成十六进制，然后后两位换成十六进制。例如，某汉字的区位码是5448，这样把54转换为16进制数36；再把位码48转换为16进制数为30，得到十六进制数3630。 2.国

用java如何把unicode码转成汉字？会有那么一天我和你躺在同一张大床上，醒来第一眼看见的就是你。用java怎么把“\u4fdd”码转成汉字？程序怎么写的？干柴遇烈火，那叫明骚，湿柴遇火苗，那才是闷骚java中将unicode码转换成汉字的方式是直接使用string类型，打印即可： String ascii="\u4f01\u4e1a";//这两个unicode码就是企业的System.ou...

如果要将码点转换为 Unicode 字符串，可以使用 Python 内置的 chr() 函数。注意，由于 Unicode 码点可能需要使用多个字节进行表示，因此在 Python 中，chr() 函数的参数应该是一个整数，而不是一个字节或字符。如果要将 Unicode 字符串转换为其对应的码点（即字符在 Unicode 中的编号），可以使用 Python 内置的 ord() 函数。Unicode 是一种用于表示世界上所有字符的编码标准，它包括 ASCII 码表中的字符以及其他语言、符号、表情等等。

一、ASCII、机内码、区位码、国标码、Unicode码他们之间是如何转换的，方程式是什么汉字系统中的过程包括区位码、国标码和机内码，其中的转换关系如下： 1.区位码（十进制）转换成区位码（十六进制）。这里要把前两个位换成十六进制，然后后两位换成十六进制。例如，某汉字的

参考博客： Unicode与UTF-8互转(C语言实现)：http://blog.csdn.net/tge7618291/article/details/7599902 汉字 Unicode 编码范围：http://www.qqxiuzi.cn/zh/hanzi-unicode-bianma.php GBK 编码：http://www.qqxiuzi.cn/zh/hanzi-GBK-bian...

VC/MFC

16,471

社区成员

421,732

社区内容

发帖

与我相关

我的任务

社区管理员

加入社区

近7日
近30日
至今

加载中

查看更多榜单

社区公告

VC/MFC社区版块或许是CSDN最“古老”的版块了，记忆之中，与CSDN的年龄几乎差不多。随着时间的推移，MFC技术渐渐的偏离了开发主流，若干年之后的今天，当我们面对着微软的这个经典之笔，内心充满着敬意，那些曾经的记忆，可以说代表着二十年前曾经的辉煌……
向经典致敬，或许是老一代程序员内心里面难以释怀的感受。互联网大行其道的今天，我们期待着MFC技术能够恢复其曾经的辉煌，或许这个期待会永远成为一种“梦想”，或许一切皆有可能……
我们希望这个版块可以很好的适配Web时代，期待更好的互联网技术能够使得MFC技术框架得以重现活力，……

试试用AI创作助手写篇文章吧

+ 用AI写文章