In telecommunication, 4B5B is a form of data communications line code. 4B5B maps groups of 4 bits of data onto groups of 5 bits for transmission. These 5 bit words are pre-determined in a dictionary and they are chosen to ensure that there will be sufficient transitions in the line state to produce a self-clocking signal. A collateral effect of the code is that 25% more bits are needed to send the same information.
Depending on the standard or specification of interest, there may be several 5-bit output codes left unused. The presence of any of the unused codes in the data stream can be used as an indication that there is a fault somewhere in the link. Therefore, the unused codes can be used to detect errors in the data stream.
The name 4B5B is generally taken to mean the FDDI version. Other 4-to-5-bit codes have been used for magnetic recording and are known as group coded recording (GCR), but those are (0,2) run-length limited codes, with at most two consecutive zeros. 4B5B allows up to three consecutive zeros (a (0,3) RLL code), providing a greater variety of control codes.
The 4B5B encoding is also used for USB Power Delivery communication, where it is sent over the USB-C CC pin (further encoded using biphase mark code) or the USB-A/B power lines (further encoded using frequency-shift keying).
4B5B codes are designed to produce at least two transitions per 5 bits of output code regardless of input data. When NRZI-encoded, the transitions provide necessary clock transitions for the receiver. For example, a run of 4 bits such as 00002 contains no transitions and that causes clocking problems for the receiver. 4B5B solves this problem by assigning the 4-bit block a 5-bit code, in this case 111102.
Three consecutive zero bits only appear in normal data when a code ending with two 0 bits (2, E) is followed by a code beginning with a 0 bit (1, 4, 5, 6), so will always appear separated by multiples of the 5-bit encoded symbol length (and never separated by a single symbol). Violations of this property are used for special synchronization codes.
The following codes are sometimes referred to as command characters. They are commonly used in pairs, although USB-PD uses 4-symbol sequences to begin its packets.
|Control character||5b symbols||Purpose|
|JK||11000 10001||Sync, Start delimiter|
|I||11111||100BASE-X idle marker|
|T||01101||USB-PD end delimiter|
|TT||01101 01101||FDDI end delimiter|
|TS||01101 11001||Not used|
|TR||01101 00111||100BASE-X end delimiter|
|SR||11001 00111||Not used|
|SS||11001 11001||Not used|
|H||00100||100BASE-X transmit error|
|JJJK||11000 11000 11000 10001||USB-PD Start Of Packet (SOP)|
|JJLL||11000 11000 00110 00110||USB-PD SOP′|
|JLJL||11000 00110 11000 00110||USB-PD SOP″|
|JSSL||11000 11001 11001 00110||USB-PD SOP′_Debug|
|JSLK||11000 11001 00110 10001||USB-PD SOP″_Debug|
|RRRS||00111 00111 00111 10001||USB-PD Hard Reset|
|RJRL||00111 11000 00111 00110||USB-PD Cable Reset|
- AES10-2008 (r2019): AES Recommended Practice for Digital Audio Engineering - Serial Multichannel Audio Digital Interface (MADI), Audio Engineering Society
- "5.3 Symbol Encoding". Universal Serial Bus Power Delivery Specification. Revision 2.0 Version 1.3. USB Implementers Forum. 12 January 2017. p. 105.
4b5b line code Shall be used. This encodes 4-bit data to 5-bit symbols for transmission and decodes 5-bit symbols to 4-bit data for consumption by the receiver.