numpy.chararray

class numpy.chararray [source]

Provides a convenient view on arrays of string and unicode values.

Note

The chararray class exists for backwards compatibility with Numarray, it is not recommended for new development. Starting from numpy 1.4, if one needs arrays of strings, it is recommended to use arrays of dtype object_, string_ or unicode_, and use the free functions in the numpy.char module for fast vectorized string operations.

Versus a regular NumPy array of type str or unicode, this class adds the following functionality:

values automatically have whitespace removed from the end when indexed
comparison operators automatically remove whitespace from the end when comparing values
vectorized string operations are provided as methods (e.g. endswith) and infix operators (e.g. "+", "*", "%")

chararrays should be created using numpy.char.array or numpy.char.asarray, rather than this constructor directly.

This constructor creates the array, using buffer (with offset and strides) if it is not None. If buffer is None, then constructs a new array with strides in “C order”, unless both len(shape) >= 2 and order='Fortran', in which case strides is in “Fortran order”.

Parameters:

Parameters:	shape : tuple Shape of the array. itemsize : int, optional Length of each array element, in number of characters. Default is 1. unicode : bool, optional Are the array elements of type unicode (True) or string (False). Default is False. buffer : int, optional Memory address of the start of the array data. Default is None, in which case a new array is created. offset : int, optional Fixed stride displacement from the beginning of an axis? Default is 0. Needs to be >=0. strides : array_like of ints, optional Strides for the array (see `ndarray.strides` for full description). Default is None. order : {‘C’, ‘F’}, optional The order in which the array data is stored in memory: ‘C’ -> “row major” order (the default), ‘F’ -> “column major” (Fortran) order.

shape : tuple

Shape of the array.

itemsize : int, optional

Length of each array element, in number of characters. Default is 1.

unicode : bool, optional

Are the array elements of type unicode (True) or string (False). Default is False.

buffer : int, optional

Memory address of the start of the array data. Default is None, in which case a new array is created.

offset : int, optional

Fixed stride displacement from the beginning of an axis? Default is 0. Needs to be >=0.

strides : array_like of ints, optional

Strides for the array (see ndarray.strides for full description). Default is None.

order : {‘C’, ‘F’}, optional

The order in which the array data is stored in memory: ‘C’ -> “row major” order (the default), ‘F’ -> “column major” (Fortran) order.

Examples

>>> charar = np.chararray((3, 3))
>>> charar[:] = 'a'
>>> charar
chararray([['a', 'a', 'a'],
       ['a', 'a', 'a'],
       ['a', 'a', 'a']],
      dtype='|S1')

>>> charar = np.chararray(charar.shape, itemsize=5)
>>> charar[:] = 'abc'
>>> charar
chararray([['abc', 'abc', 'abc'],
       ['abc', 'abc', 'abc'],
       ['abc', 'abc', 'abc']],
      dtype='|S5')

Attributes

`T`	Same as self.transpose(), except that self is returned if self.ndim < 2.
`base`	Base object if memory is from some other object.
`ctypes`	An object to simplify the interaction of the array with the ctypes module.
`data`	Python buffer object pointing to the start of the array’s data.
`dtype`	Data-type of the array’s elements.
`flags`	Information about the memory layout of the array.
`flat`	A 1-D iterator over the array.
`imag`	The imaginary part of the array.
`itemsize`	Length of one array element in bytes.
`nbytes`	Total bytes consumed by the elements of the array.
`ndim`	Number of array dimensions.
`real`	The real part of the array.
`shape`	Tuple of array dimensions.
`size`	Number of elements in the array.
`strides`	Tuple of bytes to step in each dimension when traversing an array.

Methods

`astype`(dtype[, order, casting, subok, copy])	Copy of the array, cast to a specified type.
`copy`([order])	Return a copy of the array.
`count`(sub[, start, end])	Returns an array with the number of non-overlapping occurrences of substring `sub` in the range [`start`, `end`].
`decode`([encoding, errors])	Calls `str.decode` element-wise.
`dump`(file)	Dump a pickle of the array to the specified file.
`dumps`()	Returns the pickle of the array as a string.
`encode`([encoding, errors])	Calls `str.encode` element-wise.
`endswith`(suffix[, start, end])	Returns a boolean array which is `True` where the string element in `self` ends with `suffix`, otherwise `False`.
`expandtabs`([tabsize])	Return a copy of each string element where all tab characters are replaced by one or more spaces.
`fill`(value)	Fill the array with a scalar value.
`find`(sub[, start, end])	For each element, return the lowest index in the string where substring `sub` is found.
`flatten`([order])	Return a copy of the array collapsed into one dimension.
`getfield`(dtype[, offset])	Returns a field of the given array as a certain type.
`index`(sub[, start, end])	Like `find`, but raises `ValueError` when the substring is not found.
`isalnum`()	Returns true for each element if all characters in the string are alphanumeric and there is at least one character, false otherwise.
`isalpha`()	Returns true for each element if all characters in the string are alphabetic and there is at least one character, false otherwise.
`isdecimal`()	For each element in `self`, return True if there are only decimal characters in the element.
`isdigit`()	Returns true for each element if all characters in the string are digits and there is at least one character, false otherwise.
`islower`()	Returns true for each element if all cased characters in the string are lowercase and there is at least one cased character, false otherwise.
`isnumeric`()	For each element in `self`, return True if there are only numeric characters in the element.
`isspace`()	Returns true for each element if there are only whitespace characters in the string and there is at least one character, false otherwise.
`istitle`()	Returns true for each element if the element is a titlecased string and there is at least one character, false otherwise.
`isupper`()	Returns true for each element if all cased characters in the string are uppercase and there is at least one character, false otherwise.
`item`(*args)	Copy an element of an array to a standard Python scalar and return it.
`join`(seq)	Return a string which is the concatenation of the strings in the sequence `seq`.
`ljust`(width[, fillchar])	Return an array with the elements of `self` left-justified in a string of length `width`.
`lower`()	Return an array with the elements of `self` converted to lowercase.
`lstrip`([chars])	For each element in `self`, return a copy with the leading characters removed.
`nonzero`()	Return the indices of the elements that are non-zero.
`put`(indices, values[, mode])	Set `a.flat[n] = values[n]` for all `n` in indices.
`ravel`([order])	Return a flattened array.
`repeat`(repeats[, axis])	Repeat elements of an array.
`replace`(old, new[, count])	For each element in `self`, return a copy of the string with all occurrences of substring `old` replaced by `new`.
`reshape`(shape[, order])	Returns an array containing the same data with a new shape.
`resize`(new_shape[, refcheck])	Change shape and size of array in-place.
`rfind`(sub[, start, end])	For each element in `self`, return the highest index in the string where substring `sub` is found, such that `sub` is contained within [`start`, `end`].
`rindex`(sub[, start, end])	Like `rfind`, but raises `ValueError` when the substring `sub` is not found.
`rjust`(width[, fillchar])	Return an array with the elements of `self` right-justified in a string of length `width`.
`rsplit`([sep, maxsplit])	For each element in `self`, return a list of the words in the string, using `sep` as the delimiter string.
`rstrip`([chars])	For each element in `self`, return a copy with the trailing characters removed.
`searchsorted`(v[, side, sorter])	Find indices where elements of v should be inserted in a to maintain order.
`setfield`(val, dtype[, offset])	Put a value into a specified place in a field defined by a data-type.
`setflags`([write, align, uic])	Set array flags WRITEABLE, ALIGNED, and UPDATEIFCOPY, respectively.
`sort`([axis, kind, order])	Sort an array, in-place.
`split`([sep, maxsplit])	For each element in `self`, return a list of the words in the string, using `sep` as the delimiter string.
`splitlines`([keepends])	For each element in `self`, return a list of the lines in the element, breaking at line boundaries.
`squeeze`([axis])	Remove single-dimensional entries from the shape of `a`.
`startswith`(prefix[, start, end])	Returns a boolean array which is `True` where the string element in `self` starts with `prefix`, otherwise `False`.
`strip`([chars])	For each element in `self`, return a copy with the leading and trailing characters removed.
`swapaxes`(axis1, axis2)	Return a view of the array with `axis1` and `axis2` interchanged.
`swapcase`()	For each element in `self`, return a copy of the string with uppercase characters converted to lowercase and vice versa.
`take`(indices[, axis, out, mode])	Return an array formed from the elements of `a` at the given indices.
`title`()	For each element in `self`, return a titlecased version of the string: words start with uppercase characters, all remaining cased characters are lowercase.
`tofile`(fid[, sep, format])	Write array to a file as text or binary (default).
`tolist`()	Return the array as a (possibly nested) list.
`tostring`([order])	Construct Python bytes containing the raw data bytes in the array.
`translate`(table[, deletechars])	For each element in `self`, return a copy of the string where all characters occurring in the optional argument `deletechars` are removed, and the remaining characters have been mapped through the given translation table.
`transpose`(*axes)	Returns a view of the array with axes transposed.
`upper`()	Return an array with the elements of `self` converted to uppercase.
`view`([dtype, type])	New view of array with the same data.
`zfill`(width)	Return the numeric string left-filled with zeros in a string of length `width`.

argsort

© 2008–2017 NumPy Developers
Licensed under the NumPy License.
https://docs.scipy.org/doc/numpy-1.13.0/reference/generated/numpy.chararray.html