{"_id":"569f86e697472b17002015fc","__v":0,"user":"55116f88e2990b0d00fb0552","version":{"_id":"568fce2b04440a1700e4cb4a","project":"568fce2a04440a1700e4cb47","__v":20,"createdAt":"2016-01-08T14:56:43.101Z","releaseDate":"2016-01-08T14:56:43.101Z","categories":["568fce2b04440a1700e4cb4b","568fd1b8b700ce0d002f4b1c","568fd23804440a1700e4cb5b","568fd2444719c119002ce5d8","568ff21204440a1700e4cbc1","5693732c8aa8040d009f2c28","5693738393445b0d00abdad0","5693740093445b0d00abdad1","56937445974aaa0d001ca699","5693b82173f48f0d0075c90d","5694c4cd1005590d0062cb25","569f854466a5640d00efa54c","56a264cdd15dd70d008d825b","56aa56bf318e6c1700a19ddb","56b0e6347ae4550d000627bd","56b200c0f48f270d00e0de6f","56b200c6f48f270d00e0de70","56b22a9665ddf50d0076ba40","56e92ef71996862200fd7f42","574d6577fb835c0e00ca316a"],"is_deprecated":false,"is_hidden":false,"is_beta":false,"is_stable":true,"codename":"","version_clean":"1.0.0","version":"1.0"},"project":"568fce2a04440a1700e4cb47","parentDoc":null,"category":{"_id":"569f854466a5640d00efa54c","pages":["569f85971082520d005c7027","569f86e697472b17002015fc","569fb5d2beb79a17009f8f7c"],"project":"568fce2a04440a1700e4cb47","__v":3,"version":"568fce2b04440a1700e4cb4a","sync":{"url":"","isSync":false},"reference":false,"createdAt":"2016-01-20T13:01:56.754Z","from_sync":false,"order":10,"slug":"reference","title":"Reference"},"updates":[],"next":{"pages":[],"description":""},"createdAt":"2016-01-20T13:08:54.811Z","link_external":false,"link_url":"","githubsync":"","sync_unique":"","hidden":false,"api":{"settings":"","results":{"codes":[]},"auth":"required","params":[],"url":""},"isReference":false,"order":1,"body":"Most text messages will use the more space efficient 7 bit GSM character set as it covers most Latin based language characters. The GSM character set was extended to cover a few more commonly used characters and these 10 characters are known as the Extended GSM characters and include:\n\n**€ <FF> [ \\ ] ^ { | } ~**\n\nCharacters included in the Extended GSM character set will use 2 characters (7 bit GSM) instead of the usual 1 as they require an escape character prefix.\n\nThe GSM 7-bit character set is supported default by all GSM handsets and network elements, but characters in languages such as Arabic, Chinese, Korean, Japanese or Cyrillic alphabet languages (e.g. Russian, Serbian, Bulgarian, etc.) must be encoded using the 16-bit UTF-16 character encoding otherwise known as Unicode.\n[block:callout]\n{\n  \"type\": \"info\",\n  \"body\": \"https://dynmark.uservoice.com/knowledgebase/articles/83496-gsm-alphabet-character-set-\",\n  \"title\": \"For more info on the GSM character set, see\"\n}\n[/block]\n\n[block:callout]\n{\n  \"type\": \"info\",\n  \"title\": \"For more info on different character sets\",\n  \"body\": \"https://dynmark.uservoice.com/knowledgebase/articles/79864-character-sets-for-different-languages\"\n}\n[/block]\n**Note**: If your message contains **any** characters that are not included in the GSM or Extended GSM character sets then the message will be encoded entirely as Unicode (UTF-16) and therefore each character will take 16 bits instead of 7 bits more than halving the amount of characters per SMS and potentially doubling your costs!","excerpt":"","slug":"message-encoding","type":"basic","title":"Message Encoding"}
Most text messages will use the more space efficient 7 bit GSM character set as it covers most Latin based language characters. The GSM character set was extended to cover a few more commonly used characters and these 10 characters are known as the Extended GSM characters and include: **€ <FF> [ \ ] ^ { | } ~** Characters included in the Extended GSM character set will use 2 characters (7 bit GSM) instead of the usual 1 as they require an escape character prefix. The GSM 7-bit character set is supported default by all GSM handsets and network elements, but characters in languages such as Arabic, Chinese, Korean, Japanese or Cyrillic alphabet languages (e.g. Russian, Serbian, Bulgarian, etc.) must be encoded using the 16-bit UTF-16 character encoding otherwise known as Unicode. [block:callout] { "type": "info", "body": "https://dynmark.uservoice.com/knowledgebase/articles/83496-gsm-alphabet-character-set-", "title": "For more info on the GSM character set, see" } [/block] [block:callout] { "type": "info", "title": "For more info on different character sets", "body": "https://dynmark.uservoice.com/knowledgebase/articles/79864-character-sets-for-different-languages" } [/block] **Note**: If your message contains **any** characters that are not included in the GSM or Extended GSM character sets then the message will be encoded entirely as Unicode (UTF-16) and therefore each character will take 16 bits instead of 7 bits more than halving the amount of characters per SMS and potentially doubling your costs!