The elapsed time for servicing a unit of application work. This is the sum of the path length cost and any time spent blocked.
If the latency is larger than the required target then it can be reduced by one or more of the following items:
Decrease the amount of path length work (simplify or optimize the application).
Caching data in memory instead of retrieving it from an external resource (e.g. Disk or network).
Increasing the performance of resources which are waited for (e.g. Faster disk or network).
Increasing processor clock rate.