分享

C data types

 迎风初开 2014-10-21

C data types

From Wikipedia, the free encyclopedia

In the C programming languagedata types refers to an extensive system for declaring variables of different types. The language itself provides basic arithmetic types and syntax to build array and compound types. Several headers in the standard library contain definitions of support types, that have additional properties, such as exact size, guaranteed.[1][2]

Basic types[edit]

The C language provides many basic types. Most of them are formed from one of the four basic arithmetic type specifiers in C (charintfloat anddouble), and optional specifiers (signedunsignedshortlong). All available basic arithmetic types are listed below:

TypeExplanation
charsmallest addressable unit of the machine that can contain basic character set. It is an integer type. Actual type can be either signed or unsigned depending on the implementation.
signed charsame size as char, but guaranteed to be signed.
unsigned charsame size as char, but guaranteed to be unsigned.
short
short int
signed short
signed short int
short signed integer type. At least in the [?32767,+32767] range,[3] thus at least 16 bits in size.
unsigned short
unsigned short int
same as short, but unsigned.
int
signed int
basic signed integer type. At least in the [?32767,+32767] range,[3] thus at least 16 bits in size.
unsigned
unsigned int
same as int, but unsigned.
long
long int
signed long
signed long int
long signed integer type. At least in the [?2147483647,+2147483647] range,[3] thus at least 32 bits in size.
unsigned long
unsigned long int
same as long, but unsigned.
long long
long long int
signed long long
signed long long int
long long signed integer type. At least in the [?9223372036854775807,+9223372036854775807] range,[3] thus at least 64 bits in size. Specified since the C99 version of the standard.
unsigned long long
unsigned long long int
same as long long, but unsigned. Specified since the C99 version of the standard.
floatsingle precision floating-point type. Actual properties unspecified (except minimum limits), however on most systems this is the IEEE 754 single-precision binary floating-point format. This format is required by the optional Annex F "IEC 60559 floating-point arithmetic".
doubledouble precision floating-point type. Actual properties unspecified (except minimum limits), however on most systems this is theIEEE 754 double-precision binary floating-point format. This format is required by the optional Annex F "IEC 60559 floating-point arithmetic".
long doubleextended precision floating-point type. Actual properties unspecified. Unlike types float and double, it can be either 80-bit floating point format, the non-IEEE "double-double" or IEEE 754 quadruple-precision floating-point format if a higher precision format is provided, otherwise it is the same as double. See the article on long double for details.

The actual size of integer types varies by implementation. The standard only requires size relations between the data types and minimum sizes for each data type:

The relation requirements are that the long long is not smaller than long, which is not smaller than int, which is not smaller than short. As char's size is always the minimum supported data type, all other data types can't be smaller.

The minimum size for char is 8 bits, the minimum size for short and int is 16 bits, for long it is 32 bits and long long must contain at least 64 bits.

The type int should be the integer type that the target processor is most efficient working with. This allows great flexibility: for example, all types can be 64-bit. However, several different integer width schemes (data models) are popular. This is because the data model defines how different programs communicate, a uniform data model is used within a given operating system application interface.[4]

In practice it should be noted that char is usually 8 bits in size and short is usually 16 bits in size (as are their unsigned counterparts). This holds true for platforms as diverse as 1990s SunOS 4 Unix, Microsoft MS-DOS, modern Linux, and Microchip MCC18 for embedded 8 bit PIC microcontrollers. POSIX requireschar to be exactly 8 bits in size.

The actual size and behavior of floating-point types also vary by implementation. The only guarantee is that long double is not smaller than double, which is not smaller than float. Usually, the 32-bit and 64-bit IEEE 754 binary floating-point formats are used, if supported by hardware.

Boolean type[edit]

C99 added a boolean (true/false) type (_Bool) which is defined in the <stdbool.h> header. Additionally, the standard requires that macros are defined to alias the type as bool as well as providing macros for true and false.

Size and pointer difference types[edit]

The C language provides the separate types size_t and ptrdiff_t to represent memory-related quantities. Existing types were deemed insufficient, because their size is defined according to the target processor's arithmetic capabilities, not the memory capabilities, such as available address space. Both of these types are defined in the <stddef.h> header (cstddef header in C++).

size_t is used to represent the size of any object (including arrays) in the particular implementation. It is used as the return type of the sizeof operator. The maximum size of size_t is provided via SIZE_MAX, a macro constant which is defined in the <stdint.h> header (cstdint header in C++). As an unsigned type, size_t is guaranteed to be wide enough to accommodate at least the value of 65535. Signed sizes can be represented by ssize_t, which is a POSIX extension.

ptrdiff_t is used to represent the difference between pointers.

Interface to the properties of the basic types[edit]

Information about the actual properties, such as size, of the basic arithmetic types, is provided via macro constants in two headers: <limits.h> header (climits header in C++) defines macros for integer types and <float.h> header (cfloat header in C++) defines macros for floating-point types. The actual values depend on the implementation.

Properties of integer types
  • CHAR_BIT – size of the char type in bits (at least 8 bits)
  • SCHAR_MINSHRT_MININT_MINLONG_MINLLONG_MIN(C99) – minimum possible value of signed integer types: signed charsigned shortsignedintsigned longsigned long long
  • SCHAR_MAXSHRT_MAXINT_MAXLONG_MAXLLONG_MAX(C99) – maximum possible value of signed integer types: signed charsigned short,signed intsigned longsigned long long
  • UCHAR_MAXUSHRT_MAXUINT_MAXULONG_MAXULLONG_MAX(C99) – maximum possible value of unsigned integer types: unsigned charunsignedshortunsigned intunsigned longunsigned long long
  • CHAR_MIN – minimum possible value of char
  • CHAR_MAX – maximum possible value of char
  • MB_LEN_MAX – maximum number of bytes in a multibyte character
Properties of floating-point types
  • FLT_MINDBL_MINLDBL_MIN – minimum normalized positive value of floatdoublelong double respectively
  • FLT_TRUE_MINDBL_TRUE_MINLDBL_TRUE_MIN (C11) – minimum positive value of floatdoublelong double respectively
  • FLT_MAXDBL_MAXLDBL_MAX – maximum finite value of floatdoublelong double respectively
  • FLT_ROUNDS – rounding mode for floating-point operations
  • FLT_EVAL_METHOD (C99) – evaluation method of expressions involving different floating-point types
  • FLT_RADIX – radix of the exponent in the floating-point types
  • FLT_DIGDBL_DIGLDBL_DIG – number of decimal digits that can be represented without losing precision by floatdoublelong double respectively
  • FLT_EPSILONDBL_EPSILONLDBL_EPSILON – difference between 1.0 and the next representable value of floatdoublelong double respectively
  • FLT_MANT_DIGDBL_MANT_DIGLDBL_MANT_DIG – number of FLT_RADIX-base digits in the floating-point significand for types floatdoublelongdouble respectively
  • FLT_MIN_EXPDBL_MIN_EXPLDBL_MIN_EXP – minimum negative integer such that FLT_RADIX raised to a power one less than that number is a normalized floatdoublelong double respectively
  • FLT_MIN_10_EXPDBL_MIN_10_EXPLDBL_MIN_10_EXP – minimum negative integer such that 10 raised to a power one less than that number is a normalized floatdoublelong double respectively
  • FLT_MAX_EXPDBL_MAX_EXPLDBL_MAX_EXP – maximum positive integer such that FLT_RADIX raised to a power one more than that number is a normalized floatdoublelong double respectively
  • FLT_MAX_10_EXPDBL_MAX_10_EXPLDBL_MAX_10_EXP – maximum positive integer such that 10 raised to a power one more than that number is a normalized floatdoublelong double respectively
  • DECIMAL_DIG (C99) – minimum number of decimal digits such that any number of the widest supported floating-point type can be represented in decimal with a precision of DECIMAL_DIG digits and read back in the original floating-point type without changing its value. DECIMAL_DIG is at least 10.

Fixed-width integer types[edit]

The C99 standard includes definitions of several new integer types to enhance the portability of programs.[2] The already available basic integer types were deemed insufficient, because their actual sizes are implementation defined and may vary across different systems. The new types are especially useful in embedded environments where hardware usually supports only several types and that support varies between different environments. All new types are defined in<inttypes.h> header (cinttypes header in C++) and also are available at <stdint.h> header (cstdint header in C++). The types can be grouped into the following categories:

  • Exact-width integer types which are guaranteed to have the same number N of bits across all implementations. Included only if it is available in the implementation.
  • Least-width integer types which are guaranteed to be the smallest type available in the implementation, that has at least specified number N of bits. Guaranteed to be specified for at least N=8,16,32,64.
  • Fastest integer types which are guaranteed to be the fastest integer type available in the implementation, that has at least specified number N of bits. Guaranteed to be specified for at least N=8,16,32,64.
  • Pointer integer types which are guaranteed to be able to hold a pointer
  • Maximum-width integer types which are guaranteed to be the largest integer type in the implementation

The following table summarizes the types and the interface to acquire the implementation details (N refers to the number of bits):

Type categorySigned typesUnsigned types
TypeMinimum valueMaximum valueTypeMinimum valueMaximum value
Exact widthintN_tINTN_MININTN_MAXuintN_t0UINTN_MAX
Least widthint_leastN_tINT_LEASTN_MININT_LEASTN_MAXuint_leastN_t0UINT_LEASTN_MAX
Fastestint_fastN_tINT_FASTN_MININT_FASTN_MAXuint_fastN_t0UINT_FASTN_MAX
Pointerintptr_tINTPTR_MININTPTR_MAXuintptr_t0UINTPTR_MAX
Maximum widthintmax_tINTMAX_MININTMAX_MAXuintmax_t0UINTMAX_MAX

Printf and scanf format specifiers[edit]

The <inttypes.h> header (cinttypes header in C++) provides features that enhance the functionality of the types defined in <stdint.h> header. Included are macros that define printf format string and scanf format string specifiers corresponding to the <stdint.h> types and several functions for working withintmax_t and uintmax_t types. This header was added in C99.

Printf format string

The macros are in the format PRI{fmt}{type}. Here {fmt} defines the output formatting and is one of d (decimal), x (hexadecimal), o (octal), u (unsigned) and i (integer). {type} defines the type of the argument and is one of NFASTNLEASTNPTRMAX, where N corresponds to the number of bits in the argument.

Scanf format string

The macros are in the format SCN{fmt}{type}. Here {fmt} defines the output formatting and is one of d (decimal), x (hexadecimal), o (octal), u (unsigned) and i (integer). {type} defines the type of the argument and is one of NFASTNLEASTNPTRMAX, where N corresponds to the number of bits in the argument.

Functions

Additional floating-point types[edit]

The C99 standard includes new floating-point types float_t and double_t, defined in <math.h>. They correspond to the types used for the intermediate results of floating-point expressions when FLT_EVAL_METHOD is 0, 1, or 2. These types may be wider than long double.

Structures[edit]

Structures are a way of storing multiple pieces of data in one variable. For example, say we wanted to store the name and birthday of a person in strings, in one variable. We could use a structure to house that data:

struct birthday
{
    char name[20];
    int day;
    int month;
    int year;
};

Structures may contain pointers to structs of its own type, which is common in linked data structures.

A C implementation has freedom to design the memory layout of the struct, with few restrictions; one being that the memory address of the first member will be the same as the address of struct itself. Structs may be initialized or assigned to using compound literals.

A user-written function can directly return a structure, though it will often not be very efficient at run-time.

Arrays[edit]

For every type T, except void and function types, there exist the types “array of N elements of type T”.

An array is a collection of values, all of the same type, stored contiguously in memory. An array of size N is indexed by integers from 0 up to and including N-1.

There are also "arrays of unspecified size" where the number of elements is not known by the compiler.

For example:

int cat[10];  // array of 10 elements, each of type int
int bob[];    // array of an unspecified number of 'int' elements.

Arrays can be initialized with a compound initializer, but not assigned. Arrays are passed to functions by passing a pointer to the first element.

Multidimensional arrays are defined as "array of array …". All but the outermost dimension must have compile-time constant size:

int a[10][8];  // array of 10 elements, each of type 'array of 8 int elements'
float f[][32]; // array of unspecified number of 'array of 32 float elements'

Pointer types[edit]

For every type T there exists a type “pointer to T”.

Variables can be declared as being pointers to values of various types, by means of the * type declarator. To declare a variable as a pointer, precede its name with an asterisk.

char *square;
long *circle;

Hence "for every type T" also applies to pointer types there exists multi-indirect pointers like char** or int*** and so on. There exists also "pointer to array" types, but they are less common than "array of pointer", and their syntax is quite confusing:

char *pc[10]; // array of 10 elements of 'pointer to char'
char (*pa)[10]; // pointer to a 10-element array of char

pc consumes 10×sizeof(char*) bytes (usually 40 or 80 bytes on common platforms), but pa is only one pointer, so sizeof(pa) is usually 4 or 8, and the data it refers to is an array of 10 bytes: sizeof(*pa) == 10.

Unions[edit]

Union types are special structures which allow access to the same memory using different type descriptions; one could, for example, describe a union of data types which would allow reading the same data as an integer, a float or a user declared type:

union
{
    int i;
    float f;
    struct
    {
        unsigned int u;
        double d;
    } s;
} u;

In the above example the total size of u is the size of u.s (which happens to be the sum of the sizes of u.s.u and u.s.d), since s is larger than both i andf. When assigning something to u.i, some parts of u.f may be preserved if u.i is smaller than u.f.

Reading from a union member is not the same as casting since the value of the member is not converted, but merely read.

Function pointers[edit]

Function pointers allow referencing functions with a particular signature. For example, to store the address of the standard function abs in the variablemy_int_f:

int (*my_int_f)(int) = &abs;
// the & operator can be omitted, but makes clear that the "address of" abs is used here

    本站是提供个人知识管理的网络存储空间,所有内容均由用户发布,不代表本站观点。请注意甄别内容中的联系方式、诱导购买等信息,谨防诈骗。如发现有害或侵权内容,请点击一键举报。
    转藏 分享 献花(0

    0条评论

    发表

    请遵守用户 评论公约

    类似文章 更多