Format Settings
These settings are autogenerated from source.
allow_special_bool_values_inside_variant
Allows to parse Bool values inside Variant type from special text bool values like "on", "off", "enable", "disable", etc.
bool_false_representation
Text to represent false bool value in TSV/CSV/Vertical/Pretty formats.
bool_true_representation
Text to represent true bool value in TSV/CSV/Vertical/Pretty formats.
column_names_for_schema_inference
The list of column names to use in schema inference for formats without column names. The format: 'column1,column2,column3,...'
cross_to_inner_join_rewrite
Use inner join instead of comma/cross join if there are joining expressions in the WHERE section. Values: 0 - no rewrite, 1 - apply if possible for comma/cross, 2 - force rewrite all comma joins, cross - if possible
date_time_64_output_format_cut_trailing_zeros_align_to_groups_of_thousands
Dynamically trim the trailing zeros of datetime64 values to adjust the output scale to [0, 3, 6], corresponding to 'seconds', 'milliseconds', and 'microseconds'
date_time_input_format
Allows choosing a parser of the text representation of date and time.
The setting does not apply to date and time functions.
Possible values:
- 
'best_effort'— Enables extended parsing.ClickHouse can parse the basic YYYY-MM-DD HH:MM:SSformat and all ISO 8601 date and time formats. For example,'2018-06-08T01:02:03.000Z'.
- 
'best_effort_us'— Similar tobest_effort(see the difference in parseDateTimeBestEffortUS
- 
'basic'— Use basic parser.ClickHouse can parse only the basic YYYY-MM-DD HH:MM:SSorYYYY-MM-DDformat. For example,2019-08-20 10:18:56or2019-08-20.
Cloud default value: 'best_effort'.
See also:
date_time_output_format
Allows choosing different output formats of the text representation of date and time.
Possible values:
- 
simple- Simple output format.ClickHouse output date and time YYYY-MM-DD hh:mm:ssformat. For example,2019-08-20 10:18:56. The calculation is performed according to the data type's time zone (if present) or server time zone.
- 
iso- ISO output format.ClickHouse output date and time in ISO 8601 YYYY-MM-DDThh:mm:ssZformat. For example,2019-08-20T10:18:56Z. Note that output is in UTC (Zmeans UTC).
- 
unix_timestamp- Unix timestamp output format.ClickHouse output date and time in Unix timestamp format. For example 1566285536.
See also:
date_time_overflow_behavior
Defines the behavior when Date, Date32, DateTime, DateTime64 or integers are converted into Date, Date32, DateTime or DateTime64 but the value cannot be represented in the result type.
Possible values:
- ignore— Silently ignore overflows. Result are undefined.
- throw— Throw an exception in case of overflow.
- saturate— Saturate the result. If the value is smaller than the smallest value that can be represented by the target type, the result is chosen as the smallest representable value. If the value is bigger than the largest value that can be represented by the target type, the result is chosen as the largest representable value.
Default value: ignore.
dictionary_use_async_executor
Execute a pipeline for reading dictionary source in several threads. It's supported only by dictionaries with local CLICKHOUSE source.
errors_output_format
Method to write Errors to text output.
exact_rows_before_limit
When enabled, ClickHouse will provide exact value for rows_before_limit_at_least statistic, but with the cost that the data before limit will have to be read completely
format_avro_schema_registry_url
For AvroConfluent format: Confluent Schema Registry URL.
format_binary_max_array_size
The maximum allowed size for Array in RowBinary format. It prevents allocating large amount of memory in case of corrupted data. 0 means there is no limit
format_binary_max_string_size
The maximum allowed size for String in RowBinary format. It prevents allocating large amount of memory in case of corrupted data. 0 means there is no limit
format_capn_proto_enum_comparising_mode
How to map ClickHouse Enum and CapnProto Enum
format_capn_proto_use_autogenerated_schema
Use autogenerated CapnProto schema when format_schema is not set
format_csv_allow_double_quotes
If it is set to true, allow strings in double quotes.
format_csv_allow_single_quotes
If it is set to true, allow strings in single quotes.
format_csv_delimiter
The character to be considered as a delimiter in CSV data. If setting with a string, a string has to have a length of 1.
format_csv_null_representation
Custom NULL representation in CSV format
format_custom_escaping_rule
Field escaping rule (for CustomSeparated format)
format_custom_field_delimiter
Delimiter between fields (for CustomSeparated format)
format_custom_result_after_delimiter
Suffix after result set (for CustomSeparated format)
format_custom_result_before_delimiter
Prefix before result set (for CustomSeparated format)
format_custom_row_after_delimiter
Delimiter after field of the last column (for CustomSeparated format)
format_custom_row_before_delimiter
Delimiter before field of the first column (for CustomSeparated format)
format_custom_row_between_delimiter
Delimiter between rows (for CustomSeparated format)
format_display_secrets_in_show_and_select
Enables or disables showing secrets in SHOW and SELECT queries for tables, databases,
table functions, and dictionaries.
User wishing to see secrets must also have
display_secrets_in_show_and_select server setting
turned on and a
displaySecretsInShowAndSelect privilege.
Possible values:
- 0 — Disabled.
- 1 — Enabled.
format_json_object_each_row_column_for_object_name
The name of column that will be used for storing/writing object names in JSONObjectEachRow format.
Column type should be String. If value is empty, default names row_{i}will be used for object names.
format_protobuf_use_autogenerated_schema
Use autogenerated Protobuf when format_schema is not set
format_regexp
Regular expression (for Regexp format)
format_regexp_escaping_rule
Field escaping rule (for Regexp format)
format_regexp_skip_unmatched
Skip lines unmatched by regular expression (for Regexp format)
format_schema
This parameter is useful when you are using formats that require a schema definition, such as Cap'n Proto or Protobuf. The value depends on the format.
format_schema_message_name
Define the name of the required message in the schema defined in format_schema.
To maintain compatibility with the legacy format_schema format (file_name:message_name):
- If format_schema_message_nameis not specified, the message name is inferred from themessage_namepart of the legacyformat_schemavalue.
- If format_schema_message_nameis specified while using the legacy format, an error will be raised.
format_schema_source
Define the source of format_schema.
Possible values:
- 'file' (default): The format_schemais the name of a schema file located in theformat_schemasdirectory.
- 'string': The format_schemais the literal content of the schema.
- 'query': The format_schemais a query to retrieve the schema. Whenformat_schema_sourceis set to 'query', the following conditions apply:
- The query must return exactly one value: a single row with a single string column.
- The result of the query is treated as the schema content.
- This result is cached locally in the format_schemasdirectory.
- You can clear the local cache using the command: SYSTEM DROP FORMAT SCHEMA CACHE FOR Files.
- Once cached, identical queries are not executed to fetch the schema again until the cache is explicitly cleared
- In addition to local cache files, Protobuf messages are also cached in memory. Even after clearing the local cache files, the in-memory cache must be cleared using SYSTEM DROP FORMAT SCHEMA CACHE [FOR Protobuf]to fully refresh the schema.
- Run the query SYSTEM DROP FORMAT SCHEMA CACHEto clear the cache for both cache files and Protobuf messages schemas at once.
format_template_resultset
Path to file which contains format string for result set (for Template format)
format_template_resultset_format
Format string for result set (for Template format)
format_template_row
Path to file which contains format string for rows (for Template format)
format_template_row_format
Format string for rows (for Template format)
format_template_rows_between_delimiter
Delimiter between rows (for Template format)
format_tsv_null_representation
Custom NULL representation in TSV format
input_format_allow_errors_num
Sets the maximum number of acceptable errors when reading from text formats (CSV, TSV, etc.).
The default value is 0.
Always pair it with input_format_allow_errors_ratio.
If an error occurred while reading rows but the error counter is still less than input_format_allow_errors_num, ClickHouse ignores the row and moves on to the next one.
If both input_format_allow_errors_num and input_format_allow_errors_ratio are exceeded, ClickHouse throws an exception.
input_format_allow_errors_ratio
Sets the maximum percentage of errors allowed when reading from text formats (CSV, TSV, etc.). The percentage of errors is set as a floating-point number between 0 and 1.
The default value is 0.
Always pair it with input_format_allow_errors_num.
If an error occurred while reading rows but the error counter is still less than input_format_allow_errors_ratio, ClickHouse ignores the row and moves on to the next one.
If both input_format_allow_errors_num and input_format_allow_errors_ratio are exceeded, ClickHouse throws an exception.
input_format_allow_seeks
Allow seeks while reading in ORC/Parquet/Arrow input formats.
Enabled by default.
input_format_arrow_allow_missing_columns
Allow missing columns while reading Arrow input formats
input_format_arrow_case_insensitive_column_matching
Ignore case when matching Arrow columns with CH columns.
input_format_arrow_skip_columns_with_unsupported_types_in_schema_inference
Skip columns with unsupported types while schema inference for format Arrow
input_format_avro_allow_missing_fields
For Avro/AvroConfluent format: when field is not found in schema use default value instead of error
input_format_avro_null_as_default
For Avro/AvroConfluent format: insert default in case of null and non Nullable colum
input_format_binary_decode_types_in_binary_format
Read data types in binary format instead of type names in RowBinaryWithNamesAndTypes input format
input_format_binary_read_json_as_string
Read values of JSON data type as JSON String values in RowBinary input format.
input_format_bson_skip_fields_with_unsupported_types_in_schema_inference
Skip fields with unsupported types while schema inference for format BSON.
input_format_capn_proto_skip_fields_with_unsupported_types_in_schema_inference
Skip columns with unsupported types while schema inference for format CapnProto
input_format_csv_allow_cr_end_of_line
If it is set true, \r will be allowed at end of line not followed by
input_format_csv_allow_variable_number_of_columns
Ignore extra columns in CSV input (if file has more columns than expected) and treat missing fields in CSV input as default values
input_format_csv_allow_whitespace_or_tab_as_delimiter
Allow to use spaces and tabs(\t) as field delimiter in the CSV strings
input_format_csv_arrays_as_nested_csv
When reading Array from CSV, expect that its elements were serialized in nested CSV and then put into string. Example: "[""Hello"", ""world"", ""42"""" TV""]". Braces around array can be omitted.
input_format_csv_deserialize_separate_columns_into_tuple
If it set to true, then separate columns written in CSV format can be deserialized to Tuple column.
input_format_csv_detect_header
Automatically detect header with names and types in CSV format
input_format_csv_empty_as_default
Treat empty fields in CSV input as default values.
input_format_csv_enum_as_number
Treat inserted enum values in CSV formats as enum indices
input_format_csv_skip_first_lines
Skip specified number of lines at the beginning of data in CSV format
input_format_csv_skip_trailing_empty_lines
Skip trailing empty lines in CSV format
input_format_csv_trim_whitespaces
Trims spaces and tabs (\t) characters at the beginning and end in CSV strings
input_format_csv_try_infer_numbers_from_strings
If enabled, during schema inference ClickHouse will try to infer numbers from string fields. It can be useful if CSV data contains quoted UInt64 numbers.
Disabled by default.
input_format_csv_try_infer_strings_from_quoted_tuples
Interpret quoted tuples in the input data as a value of type String.
input_format_csv_use_best_effort_in_schema_inference
Use some tweaks and heuristics to infer schema in CSV format
input_format_csv_use_default_on_bad_values
Allow to set default value to column when CSV field deserialization failed on bad value
input_format_custom_allow_variable_number_of_columns
Ignore extra columns in CustomSeparated input (if file has more columns than expected) and treat missing fields in CustomSeparated input as default values
input_format_custom_detect_header
Automatically detect header with names and types in CustomSeparated format
input_format_custom_skip_trailing_empty_lines
Skip trailing empty lines in CustomSeparated format
input_format_defaults_for_omitted_fields
When performing INSERT queries, replace omitted input column values with default values of the respective columns. This option applies to JSONEachRow (and other JSON formats), CSV, TabSeparated, TSKV, Parquet, Arrow, Avro, ORC, Native formats and formats with WithNames/WithNamesAndTypes suffixes.
When this option is enabled, extended table metadata are sent from server to client. It consumes additional computing resources on the server and can reduce performance.
Possible values:
- 0 — Disabled.
- 1 — Enabled.
input_format_force_null_for_omitted_fields
Force initialize omitted fields with null values
input_format_hive_text_allow_variable_number_of_columns
Ignore extra columns in Hive Text input (if file has more columns than expected) and treat missing fields in Hive Text input as default values
input_format_hive_text_collection_items_delimiter
Delimiter between collection(array or map) items in Hive Text File
input_format_hive_text_fields_delimiter
Delimiter between fields in Hive Text File
input_format_hive_text_map_keys_delimiter
Delimiter between a pair of map key/values in Hive Text File
input_format_import_nested_json
Enables or disables the insertion of JSON data with nested objects.
Supported formats:
Possible values:
- 0 — Disabled.
- 1 — Enabled.
See also:
- Usage of Nested Structures with the JSONEachRowformat.
input_format_ipv4_default_on_conversion_error
Deserialization of IPv4 will use default values instead of throwing exception on conversion error.
Disabled by default.
input_format_ipv6_default_on_conversion_error
Deserialization of IPV6 will use default values instead of throwing exception on conversion error.
Disabled by default.
input_format_json_compact_allow_variable_number_of_columns
Allow variable number of columns in rows in JSONCompact/JSONCompactEachRow input formats. Ignore extra columns in rows with more columns than expected and treat missing columns as default values.
Disabled by default.
input_format_json_defaults_for_missing_elements_in_named_tuple
Insert default values for missing elements in JSON object while parsing named tuple.
This setting works only when setting input_format_json_named_tuples_as_objects is enabled.
Enabled by default.
input_format_json_empty_as_default
When enabled, replace empty input fields in JSON with default values. For complex default expressions input_format_defaults_for_omitted_fields must be enabled too.
Possible values:
- 0 — Disable.
- 1 — Enable.
input_format_json_ignore_unknown_keys_in_named_tuple
Ignore unknown keys in json object for named tuples.
Enabled by default.
input_format_json_ignore_unnecessary_fields
Ignore unnecessary fields and not parse them. Enabling this may not throw exceptions on json strings of invalid format or with duplicated fields
input_format_json_infer_array_of_dynamic_from_array_of_different_types
If enabled, during schema inference ClickHouse will use Array(Dynamic) type for JSON arrays with values of different data types.
Example:
Enabled by default.
input_format_json_infer_incomplete_types_as_strings
Allow to use String type for JSON keys that contain only Null/{}/[] in data sample during schema inference.
In JSON formats any value can be read as String, and we can avoid errors like Cannot determine type for column 'column_name' by first 25000 rows of data, most likely this column contains only Nulls or empty Arrays/Maps during schema inference
by using String type for keys with unknown types.
Example:
Result:
Enabled by default.
input_format_json_map_as_array_of_tuples
Deserialize maps columns as JSON arrays of tuples.
Disabled by default.
input_format_json_max_depth
Maximum depth of a field in JSON. This is not a strict limit, it does not have to be applied precisely.
input_format_json_named_tuples_as_objects
Parse named tuple columns as JSON objects.
Enabled by default.
input_format_json_read_arrays_as_strings
Allow parsing JSON arrays as strings in JSON input formats.
Example:
Result:
Enabled by default.
input_format_json_read_bools_as_numbers
Allow parsing bools as numbers in JSON input formats.
Enabled by default.
input_format_json_read_bools_as_strings
Allow parsing bools as strings in JSON input formats.
Enabled by default.
input_format_json_read_numbers_as_strings
Allow parsing numbers as strings in JSON input formats.
Enabled by default.
input_format_json_read_objects_as_strings
Allow parsing JSON objects as strings in JSON input formats.
Example:
Result:
Enabled by default.
input_format_json_throw_on_bad_escape_sequence
Throw an exception if JSON string contains bad escape sequence in JSON input formats. If disabled, bad escape sequences will remain as is in the data.
Enabled by default.
input_format_json_try_infer_named_tuples_from_objects
If enabled, during schema inference ClickHouse will try to infer named Tuple from JSON objects. The resulting named Tuple will contain all elements from all corresponding JSON objects from sample data.
Example:
Result:
Enabled by default.
input_format_json_try_infer_numbers_from_strings
If enabled, during schema inference ClickHouse will try to infer numbers from string fields. It can be useful if JSON data contains quoted UInt64 numbers.
Disabled by default.
input_format_json_use_string_type_for_ambiguous_paths_in_named_tuples_inference_from_objects
Use String type instead of an exception in case of ambiguous paths in JSON objects during named tuples inference
input_format_json_validate_types_from_metadata
For JSON/JSONCompact/JSONColumnsWithMetadata input formats, if this setting is set to 1, the types from metadata in input data will be compared with the types of the corresponding columns from the table.
Enabled by default.
input_format_max_block_size_bytes
Limits the size of the blocks formed during data parsing in input formats in bytes. Used in row based input formats when block is formed on ClickHouse side. 0 means no limit in bytes.
input_format_max_bytes_to_read_for_schema_inference
The maximum amount of data in bytes to read for automatic schema inference.
input_format_max_rows_to_read_for_schema_inference
The maximum rows of data to read for automatic schema inference.
input_format_msgpack_number_of_columns
The number of columns in inserted MsgPack data. Used for automatic schema inference from data.
input_format_mysql_dump_map_column_names
Match columns from table in MySQL dump and columns from ClickHouse table by names
input_format_mysql_dump_table_name
Name of the table in MySQL dump from which to read data
input_format_native_allow_types_conversion
Allow data types conversion in Native input format
input_format_native_decode_types_in_binary_format
Read data types in binary format instead of type names in Native input format
input_format_null_as_default
Enables or disables the initialization of NULL fields with default values, if data type of these fields is not nullable.
If column type is not nullable and this setting is disabled, then inserting NULL causes an exception. If column type is nullable, then NULL values are inserted as is, regardless of this setting.
This setting is applicable for most input formats.
For complex default expressions input_format_defaults_for_omitted_fields must be enabled too.
Possible values:
- 0 — Inserting NULLinto a not nullable column causes an exception.
- 1 — NULLfields are initialized with default column values.
input_format_orc_allow_missing_columns
Allow missing columns while reading ORC input formats
input_format_orc_case_insensitive_column_matching
Ignore case when matching ORC columns with CH columns.
input_format_orc_dictionary_as_low_cardinality
Treat ORC dictionary encoded columns as LowCardinality columns while reading ORC files.
input_format_orc_filter_push_down
When reading ORC files, skip whole stripes or row groups based on the WHERE/PREWHERE expressions, min/max statistics or bloom filter in the ORC metadata.
input_format_orc_reader_time_zone_name
The time zone name for ORC row reader, the default ORC row reader's time zone is GMT.
input_format_orc_row_batch_size
Batch size when reading ORC stripes.
input_format_orc_skip_columns_with_unsupported_types_in_schema_inference
Skip columns with unsupported types while schema inference for format ORC
input_format_orc_use_fast_decoder
Use a faster ORC decoder implementation.
input_format_parquet_allow_geoparquet_parser
Use geo column parser to convert Array(UInt8) into Point/Linestring/Polygon/MultiLineString/MultiPolygon types
input_format_parquet_allow_missing_columns
Allow missing columns while reading Parquet input formats
input_format_parquet_bloom_filter_push_down
When reading Parquet files, skip whole row groups based on the WHERE expressions and bloom filter in the Parquet metadata.
input_format_parquet_case_insensitive_column_matching
Ignore case when matching Parquet columns with CH columns.
input_format_parquet_enable_json_parsing
When reading Parquet files, parse JSON columns as ClickHouse JSON Column.
input_format_parquet_enable_row_group_prefetch
Enable row group prefetching during parquet parsing. Currently, only single-threaded parsing can prefetch.
input_format_parquet_filter_push_down
When reading Parquet files, skip whole row groups based on the WHERE/PREWHERE expressions and min/max statistics in the Parquet metadata.
input_format_parquet_local_file_min_bytes_for_seek
Min bytes required for local read (file) to do seek, instead of read with ignore in Parquet input format
input_format_parquet_max_block_size
Max block size for parquet reader.
input_format_parquet_memory_high_watermark
Approximate memory limit for Parquet reader v3. Limits how many row groups or columns can be read in parallel. When reading multiple files in one query, the limit is on total memory usage across those files.
input_format_parquet_memory_low_watermark
Schedule prefetches more aggressively if memory usage is below than threshold. Potentially useful e.g. if there are many small bloom filters to read over network.
input_format_parquet_page_filter_push_down
Skip pages using min/max values from column index.
input_format_parquet_prefer_block_bytes
Average block bytes output by parquet reader
input_format_parquet_preserve_order
Avoid reordering rows when reading from Parquet files. Not recommended as row ordering is generally not guaranteed, and other parts of query pipeline may break it. Use ORDER BY _row_number instead.
input_format_parquet_skip_columns_with_unsupported_types_in_schema_inference
Skip columns with unsupported types while schema inference for format Parquet
input_format_parquet_use_native_reader
Use native parquet reader v1. It's relatively fast but unfinished. Deprecated.
input_format_parquet_use_native_reader_v3 {#input_format_parquet_use_native_reader_v3} Experimental feature. Learn more.
Use Parquet reader v3. Experimental.
input_format_parquet_use_offset_index
Minor tweak to how pages are read from parquet file when no page filtering is used.
input_format_protobuf_flatten_google_wrappers
Enable Google wrappers for regular non-nested columns, e.g. google.protobuf.StringValue 'str' for String column 'str'. For Nullable columns empty wrappers are recognized as defaults, and missing as nulls
input_format_protobuf_oneof_presence
Indicate which field of protobuf oneof was found by means of setting enum value in a special colum
input_format_protobuf_skip_fields_with_unsupported_types_in_schema_inference
Skip fields with unsupported types while schema inference for format Protobuf
input_format_record_errors_file_path
Path of the file used to record errors while reading text formats (CSV, TSV).
input_format_skip_unknown_fields
Enables or disables skipping insertion of extra data.
When writing data, ClickHouse throws an exception if input data contain columns that do not exist in the target table. If skipping is enabled, ClickHouse does not insert extra data and does not throw an exception.
Supported formats:
- JSONEachRow (and other JSON formats)
- BSONEachRow (and other JSON formats)
- TSKV
- All formats with suffixes WithNames/WithNamesAndTypes
- MySQLDump
- Native
Possible values:
- 0 — Disabled.
- 1 — Enabled.
input_format_try_infer_dates
If enabled, ClickHouse will try to infer type Date from string fields in schema inference for text formats. If all fields from a column in input data were successfully parsed as dates, the result type will be Date, if at least one field was not parsed as date, the result type will be String.
Enabled by default.
input_format_try_infer_datetimes
If enabled, ClickHouse will try to infer type DateTime64 from string fields in schema inference for text formats. If all fields from a column in input data were successfully parsed as datetimes, the result type will be DateTime64, if at least one field was not parsed as datetime, the result type will be String.
Enabled by default.
input_format_try_infer_datetimes_only_datetime64
When input_format_try_infer_datetimes is enabled, infer only DateTime64 but not DateTime types
input_format_try_infer_exponent_floats
Try to infer floats in exponential notation while schema inference in text formats (except JSON, where exponent numbers are always inferred)
input_format_try_infer_integers
If enabled, ClickHouse will try to infer integers instead of floats in schema inference for text formats. If all numbers in the column from input data are integers, the result type will be Int64, if at least one number is float, the result type will be Float64.
Enabled by default.
input_format_try_infer_variants
If enabled, ClickHouse will try to infer type Variant in schema inference for text formats when there is more than one possible type for column/array elements.
Possible values:
- 0 — Disabled.
- 1 — Enabled.
input_format_tsv_allow_variable_number_of_columns
Ignore extra columns in TSV input (if file has more columns than expected) and treat missing fields in TSV input as default values
input_format_tsv_crlf_end_of_line
If it is set true, file function will read TSV format with \r\n instead of \n.
input_format_tsv_detect_header
Automatically detect header with names and types in TSV format
input_format_tsv_empty_as_default
Treat empty fields in TSV input as default values.
input_format_tsv_enum_as_number
Treat inserted enum values in TSV formats as enum indices.
input_format_tsv_skip_first_lines
Skip specified number of lines at the beginning of data in TSV format
input_format_tsv_skip_trailing_empty_lines
Skip trailing empty lines in TSV format
input_format_tsv_use_best_effort_in_schema_inference
Use some tweaks and heuristics to infer schema in TSV format
input_format_values_accurate_types_of_literals
For Values format: when parsing and interpreting expressions using template, check actual type of literal to avoid possible overflow and precision issues.
input_format_values_deduce_templates_of_expressions
For Values format: if the field could not be parsed by streaming parser, run SQL parser, deduce template of the SQL expression, try to parse all rows using template and then interpret expression for all rows.
input_format_values_interpret_expressions
For Values format: if the field could not be parsed by streaming parser, run SQL parser and try to interpret it as SQL expression.
input_format_with_names_use_header
Enables or disables checking the column order when inserting data.
To improve insert performance, we recommend disabling this check if you are sure that the column order of the input data is the same as in the target table.
Supported formats:
- CSVWithNames
- CSVWithNamesAndTypes
- TabSeparatedWithNames
- TabSeparatedWithNamesAndTypes
- JSONCompactEachRowWithNames
- JSONCompactEachRowWithNamesAndTypes
- JSONCompactStringsEachRowWithNames
- JSONCompactStringsEachRowWithNamesAndTypes
- RowBinaryWithNames
- RowBinaryWithNamesAndTypes
- CustomSeparatedWithNames
- CustomSeparatedWithNamesAndTypes
Possible values:
- 0 — Disabled.
- 1 — Enabled.
input_format_with_types_use_header
Controls whether format parser should check if data types from the input data match data types from the target table.
Supported formats:
- CSVWithNamesAndTypes
- TabSeparatedWithNamesAndTypes
- JSONCompactEachRowWithNamesAndTypes
- JSONCompactStringsEachRowWithNamesAndTypes
- RowBinaryWithNamesAndTypes
- CustomSeparatedWithNamesAndTypes
Possible values:
- 0 — Disabled.
- 1 — Enabled.
insert_distributed_one_random_shard
Enables or disables random shard insertion into a Distributed table when there is no distributed key.
By default, when inserting data into a Distributed table with more than one shard, the ClickHouse server will reject any insertion request if there is no distributed key. When insert_distributed_one_random_shard = 1, insertions are allowed and data is forwarded randomly among all shards.
Possible values:
- 0 — Insertion is rejected if there are multiple shards and no distributed key is given.
- 1 — Insertion is done randomly among all available shards when no distributed key is given.
interval_output_format
Allows choosing different output formats of the text representation of interval types.
Possible values:
- 
kusto- KQL-style output format.ClickHouse outputs intervals in KQL format. For example, toIntervalDay(2)would be formatted as2.00:00:00. Please note that for interval types of varying length (ie.IntervalMonthandIntervalYear) the average number of seconds per interval is taken into account.
- 
numeric- Numeric output format.ClickHouse outputs intervals as their underlying numeric representation. For example, toIntervalDay(2)would be formatted as2.
See also:
json_type_escape_dots_in_keys
When enabled, dots in JSON keys will be escaped during parsing.
output_format_arrow_compression_method
Compression method for Arrow output format. Supported codecs: lz4_frame, zstd, none (uncompressed)
output_format_arrow_fixed_string_as_fixed_byte_array
Use Arrow FIXED_SIZE_BINARY type instead of Binary for FixedString columns.
output_format_arrow_low_cardinality_as_dictionary
Enable output LowCardinality type as Dictionary Arrow type
output_format_arrow_string_as_string
Use Arrow String type instead of Binary for String columns
output_format_arrow_use_64_bit_indexes_for_dictionary
Always use 64 bit integers for dictionary indexes in Arrow format
output_format_arrow_use_signed_indexes_for_dictionary
Use signed integers for dictionary indexes in Arrow format
output_format_avro_codec
Compression codec used for output. Possible values: 'null', 'deflate', 'snappy', 'zstd'.
output_format_avro_rows_in_file
Max rows in a file (if permitted by storage)
output_format_avro_string_column_pattern
For Avro format: regexp of String columns to select as AVRO string.
output_format_avro_sync_interval
Sync interval in bytes.
output_format_binary_encode_types_in_binary_format
Write data types in binary format instead of type names in RowBinaryWithNamesAndTypes output format
output_format_binary_write_json_as_string
Write values of JSON data type as JSON String values in RowBinary output format.
output_format_bson_string_as_string
Use BSON String type instead of Binary for String columns.
output_format_csv_crlf_end_of_line
If it is set true, end of line in CSV format will be \r\n instead of \n.
output_format_csv_serialize_tuple_into_separate_columns
If it set to true, then Tuples in CSV format are serialized as separate columns (that is, their nesting in the tuple is lost)
output_format_decimal_trailing_zeros
Output trailing zeros when printing Decimal values. E.g. 1.230000 instead of 1.23.
Disabled by default.
output_format_json_array_of_rows
Enables the ability to output all rows as a JSON array in the JSONEachRow format.
Possible values:
- 1 — ClickHouse outputs all rows as an array, each row in the JSONEachRowformat.
- 0 — ClickHouse outputs each row separately in the JSONEachRowformat.
Example of a query with the enabled setting
Query:
Result:
Example of a query with the disabled setting
Query:
Result:
output_format_json_escape_forward_slashes
Controls escaping forward slashes for string outputs in JSON output format. This is intended for compatibility with JavaScript. Don't confuse with backslashes that are always escaped.
Enabled by default.
output_format_json_map_as_array_of_tuples
Serialize maps columns as JSON arrays of tuples.
Disabled by default.
output_format_json_named_tuples_as_objects
Serialize named tuple columns as JSON objects.
Enabled by default.
output_format_json_pretty_print
This setting determines how nested structures such as Tuples, Maps, and Arrays are displayed within the data array when using the JSON output format.
For example, instead of output:
The output will be formatted as:
Enabled by default.
output_format_json_quote_64bit_floats
Controls quoting of 64-bit floats when they are output in JSON* formats.
Disabled by default.
output_format_json_quote_64bit_integers
Controls quoting of 64-bit or bigger integers (like UInt64 or Int128) when they are output in a JSON format.
Such integers are enclosed in quotes by default. This behavior is compatible with most JavaScript implementations.
Possible values:
- 0 — Integers are output without quotes.
- 1 — Integers are enclosed in quotes.
output_format_json_quote_decimals
Controls quoting of decimals in JSON output formats.
Disabled by default.
output_format_json_quote_denormals
Enables +nan, -nan, +inf, -inf outputs in JSON output format.
Possible values:
- 0 — Disabled.
- 1 — Enabled.
Example
Consider the following table account_orders:
When output_format_json_quote_denormals = 0, the query returns null values in output:
When output_format_json_quote_denormals = 1, the query returns:
output_format_json_skip_null_value_in_named_tuples
Skip key value pairs with null value when serialize named tuple columns as JSON objects. It is only valid when output_format_json_named_tuples_as_objects is true.
output_format_json_validate_utf8
Controls validation of UTF-8 sequences in JSON output formats, doesn't impact formats JSON/JSONCompact/JSONColumnsWithMetadata, they always validate UTF-8.
Disabled by default.
output_format_markdown_escape_special_characters
When enabled, escape special characters in Markdown.
Common Mark defines the following special characters that can be escaped by :
Possible values:
- 0 — Disable.
- 1 — Enable.
output_format_msgpack_uuid_representation
The way how to output UUID in MsgPack format.
output_format_native_encode_types_in_binary_format
Write data types in binary format instead of type names in Native output format
output_format_native_use_flattened_dynamic_and_json_serialization
Write data of JSON and Dynamic columns in a flattened format (all types/paths as separate subcolumns).
output_format_native_write_json_as_string
Write data of JSON column as String column containing JSON strings instead of default native JSON serialization.
output_format_orc_compression_block_size
The size of the compression block in bytes for ORC output format.
output_format_orc_compression_method
Compression method for ORC output format. Supported codecs: lz4, snappy, zlib, zstd, none (uncompressed)
output_format_orc_dictionary_key_size_threshold
For a string column in ORC output format, if the number of distinct values is greater than this fraction of the total number of non-null rows, turn off dictionary encoding. Otherwise dictionary encoding is enabled
output_format_orc_row_index_stride
Target row index stride in ORC output format
output_format_orc_string_as_string
Use ORC String type instead of Binary for String columns
output_format_orc_writer_time_zone_name
The time zone name for ORC writer, the default ORC writer's time zone is GMT.
output_format_parquet_batch_size
Check page size every this many rows. Consider decreasing if you have columns with average values size above a few KBs.
output_format_parquet_bloom_filter_bits_per_value
Approximate number of bits to use for each distinct value in parquet bloom filters. Estimated false positive rates:
- 6 bits - 10%
- 10.5 bits - 1%
- 16.9 bits - 0.1%
- 26.4 bits - 0.01%
- 41 bits - 0.001%
output_format_parquet_bloom_filter_flush_threshold_bytes
Where in the parquet file to place the bloom filters. Bloom filters will be written in groups of approximately this size. In particular:
- if 0, each row group's bloom filters are written immediately after the row group,
- if greater than the total size of all bloom filters, bloom filters for all row groups will be accumulated in memory, then written together near the end of the file,
- otherwise, bloom filters will be accumulated in memory and written out whenever their total size goes above this value.
output_format_parquet_compliant_nested_types
In parquet file schema, use name 'element' instead of 'item' for list elements. This is a historical artifact of Arrow library implementation. Generally increases compatibility, except perhaps with some old versions of Arrow.
output_format_parquet_compression_method
Compression method for Parquet output format. Supported codecs: snappy, lz4, brotli, zstd, gzip, none (uncompressed)
output_format_parquet_data_page_size
Target page size in bytes, before compression.
output_format_parquet_date_as_uint16
Write Date values as plain 16-bit numbers (read back as UInt16), instead of converting to a 32-bit parquet DATE type (read back as Date32).
output_format_parquet_datetime_as_uint32
Write DateTime values as raw unix timestamp (read back as UInt32), instead of converting to milliseconds (read back as DateTime64(3)).
output_format_parquet_enum_as_byte_array
Write enum using parquet physical type: BYTE_ARRAY and logical type: ENUM
output_format_parquet_fixed_string_as_fixed_byte_array
Use Parquet FIXED_LENGTH_BYTE_ARRAY type instead of Binary for FixedString columns.
output_format_parquet_geometadata
Allow to write information about geo columns in parquet metadata and encode columns in WKB format.
output_format_parquet_max_dictionary_size
If dictionary size grows bigger than this many bytes, switch to encoding without dictionary. Set to 0 to disable dictionary encoding.
output_format_parquet_parallel_encoding
Do Parquet encoding in multiple threads. Requires output_format_parquet_use_custom_encoder.
output_format_parquet_row_group_size
Target row group size in rows.
output_format_parquet_row_group_size_bytes
Target row group size in bytes, before compression.
output_format_parquet_string_as_string
Use Parquet String type instead of Binary for String columns.
output_format_parquet_use_custom_encoder
Use a faster Parquet encoder implementation.
output_format_parquet_version
Parquet format version for output format. Supported versions: 1.0, 2.4, 2.6 and 2.latest (default)
output_format_parquet_write_bloom_filter
Write bloom filters in parquet files. Requires output_format_parquet_use_custom_encoder = true.
output_format_parquet_write_page_index
Write column index and offset index (i.e. statistics about each data page, which may be used for filter pushdown on read) into parquet files.
output_format_pretty_color
Use ANSI escape sequences in Pretty formats. 0 - disabled, 1 - enabled, 'auto' - enabled if a terminal.
output_format_pretty_display_footer_column_names
Display column names in the footer if there are many table rows.
Possible values:
- 0 — No column names are displayed in the footer.
- 1 — Column names are displayed in the footer if row count is greater than or equal to the threshold value set by output_format_pretty_display_footer_column_names_min_rows (50 by default).
Example
Query:
Result:
output_format_pretty_display_footer_column_names_min_rows
Sets the minimum number of rows for which a footer with column names will be displayed if setting output_format_pretty_display_footer_column_names is enabled.
output_format_pretty_fallback_to_vertical
If enabled, and the table is wide but short, the Pretty format will output it as the Vertical format does.
See output_format_pretty_fallback_to_vertical_max_rows_per_chunk and output_format_pretty_fallback_to_vertical_min_table_width for detailed tuning of this behavior.
output_format_pretty_fallback_to_vertical_max_rows_per_chunk
The fallback to Vertical format (see output_format_pretty_fallback_to_vertical) will be activated only if the number of records in a chunk is not more than the specified value.
output_format_pretty_fallback_to_vertical_min_columns
The fallback to Vertical format (see output_format_pretty_fallback_to_vertical) will be activated only if the number of columns is greater than the specified value.
output_format_pretty_fallback_to_vertical_min_table_width
The fallback to Vertical format (see output_format_pretty_fallback_to_vertical) will be activated only if the sum of lengths of columns in a table is at least the specified value, or if at least one value contains a newline character.
output_format_pretty_glue_chunks
If the data rendered in Pretty formats arrived in multiple chunks, even after a delay, but the next chunk has the same column widths as the previous, use ANSI escape sequences to move back to the previous line and overwrite the footer of the previous chunk to continue it with the data of the new chunk. This makes the result more visually pleasant.
0 - disabled, 1 - enabled, 'auto' - enabled if a terminal.
output_format_pretty_grid_charset
Charset for printing grid borders. Available charsets: ASCII, UTF-8 (default one).
output_format_pretty_highlight_digit_groups
If enabled and if output is a terminal, highlight every digit corresponding to the number of thousands, millions, etc. with underline.
output_format_pretty_highlight_trailing_spaces
If enabled and if output is a terminal, highlight trailing spaces with a gray color and underline.
output_format_pretty_max_column_name_width_cut_to
If the column name is too long, cut it to this length.
The column will be cut if it is longer than output_format_pretty_max_column_name_width_cut_to plus output_format_pretty_max_column_name_width_min_chars_to_cut.
output_format_pretty_max_column_name_width_min_chars_to_cut
Minimum characters to cut if the column name is too long.
The column will be cut if it is longer than output_format_pretty_max_column_name_width_cut_to plus output_format_pretty_max_column_name_width_min_chars_to_cut.
output_format_pretty_max_column_pad_width
Maximum width to pad all values in a column in Pretty formats.
output_format_pretty_max_rows
Rows limit for Pretty formats.
output_format_pretty_max_value_width
Maximum width of value to display in Pretty formats. If greater - it will be cut. The value 0 means - never cut.
output_format_pretty_max_value_width_apply_for_single_value
Only cut values (see the output_format_pretty_max_value_width setting) when it is not a single value in a block. Otherwise output it entirely, which is useful for the SHOW CREATE TABLE query.
output_format_pretty_multiline_fields
If enabled, Pretty formats will render multi-line fields inside table cell, so the table's outline will be preserved. If not, they will be rendered as is, potentially deforming the table (one upside of keeping it off is that copy-pasting multi-line values will be easier).
output_format_pretty_row_numbers
Add row numbers before each row for pretty output format
output_format_pretty_single_large_number_tip_threshold
Print a readable number tip on the right side of the table if the block consists of a single number which exceeds this value (except 0)
output_format_pretty_squash_consecutive_ms
Wait for the next block for up to specified number of milliseconds and squash it to the previous before writing. This avoids frequent output of too small blocks, but still allows to display data in a streaming fashion.
output_format_pretty_squash_max_wait_ms
Output the pending block in pretty formats if more than the specified number of milliseconds has passed since the previous output.
output_format_protobuf_nullables_with_google_wrappers
When serializing Nullable columns with Google wrappers, serialize default values as empty wrappers. If turned off, default and null values are not serialized
output_format_schema
The path to the file where the automatically generated schema will be saved in Cap'n Proto or Protobuf formats.
output_format_sql_insert_include_column_names
Include column names in INSERT query
output_format_sql_insert_max_batch_size
The maximum number of rows in one INSERT statement.
output_format_sql_insert_quote_names
Quote column names with '`' characters
output_format_sql_insert_table_name
The name of table in the output INSERT query
output_format_sql_insert_use_replace
Use REPLACE statement instead of INSERT
output_format_tsv_crlf_end_of_line
If it is set true, end of line in TSV format will be \r\n instead of \n.
output_format_values_escape_quote_with_quote
If true escape ' with '', otherwise quoted with \'
output_format_write_statistics
Write statistics about read rows, bytes, time elapsed in suitable output formats.
Enabled by default
precise_float_parsing
Prefer more precise (but slower) float parsing algorithm
regexp_dict_allow_hyperscan
Allow regexp_tree dictionary using Hyperscan library.
regexp_dict_flag_case_insensitive
Use case-insensitive matching for a regexp_tree dictionary. Can be overridden in individual expressions with (?i) and (?-i).
regexp_dict_flag_dotall
Allow '.' to match newline characters for a regexp_tree dictionary.
rows_before_aggregation
When enabled, ClickHouse will provide exact value for rows_before_aggregation statistic, represents the number of rows read before aggregatio
schema_inference_hints
The list of column names and types to use as hints in schema inference for formats without schema.
Example:
Query:
Result:
If the schema_inference_hints is not formatted properly, or if there is a typo or a wrong datatype, etc... the whole schema_inference_hints will be ignored.
schema_inference_make_columns_nullable
Controls making inferred types Nullable in schema inference.
Possible values:
- 0 - the inferred type will never be Nullable(use input_format_null_as_default to control what do do with null values in this case),
- 1 - all inferred types will be Nullable,
- 2 or auto- the inferred type will beNullableonly if the column containsNULLin a sample that is parsed during schema inference or file metadata contains information about column nullability,
- 3 - the inferred type nullability will match file metadata if the format has it (e.g. Parquet), always Nullable otherwise (e.g. CSV).
schema_inference_make_json_columns_nullable
Controls making inferred JSON types Nullable in schema inference.
If this setting is enabled together with schema_inference_make_columns_nullable, inferred JSON type will be Nullable.
schema_inference_mode
Mode of schema inference. 'default' - assume that all files have the same schema and schema can be inferred from any file, 'union' - files can have different schemas and the resulting schema should be the a union of schemas of all files
show_create_query_identifier_quoting_rule
Set the quoting rule for identifiers in SHOW CREATE query
show_create_query_identifier_quoting_style
Set the quoting style for identifiers in SHOW CREATE query
type_json_skip_duplicated_paths
When enabled, during parsing JSON object into JSON type duplicated paths will be ignored and only the first one will be inserted instead of an exceptio
validate_experimental_and_suspicious_types_inside_nested_types
Validate usage of experimental and suspicious types inside nested types like Array/Map/Tuple
