Discovery Guide > Introducing Discovery > About Indexing
 
About Indexing
By default, when you index a data source, all tables in the data source are fully indexed. But if time becomes an issue, here are some ways to deal with it:
If you have an especially large data source, you might want to index it at a time when there is the least amount of data source activity. You might also want to use sampling for large data sources. See About Data Sampling.
You can limit the scope of indexing and improve performance by excluding catalogs, schemas, or tables from the data source. See Modifying a Data Source Definition, for more information.
These factors affect how long the indexing and relationship discovery process might take:
The number of rows and columns in the data source.
The TDV machine and processor.
The maximum Java heap setting.
When Discovery creates the index, it saves it in the TDV installation directory. You can specify a different location of this directory. See Changing the Index Directory.