A sequence of n adjacent symbols in particular order.

In the context of NLP, it is a collection of n successive items in a text document that may include words, numbers, symbols, and punctuation. N-grams typically extend the vocabulary of the corpus