If you want to specify a set of performance counters to enable when profiling from the command line, you need to pass the name of a configuration file to the –counterfile option. You can generate a counter configuration file from within the Visual Studio client by using the “Save Counters” button on the Counter Selection page of the APP Profiler Settings dialog. The format of this configuration file is one counter name per line. An example of the contents of this file is given below.

 Wavefronts
 ALUInsts
 FetchInsts
 WriteInsts
 ALUBusy
 ALUFetchRatio
 ALUPacking
 FetchSize
 CacheHit
 FetchUnitBusy
 FetchUnitStalled
 WriteUnitStalled
 FastPath
 CompletePath
 PathUtilization
 LDSFetchInsts
 LDSWriteInsts
 LDSBankConflict

Format of API rules configuration file (argument passed to –apirulesfile)

If you want to specify a set of rules to use when generating the summary pages from a trace file when using the command line, you need to pass the name of a configuration file to the –apirulesfile option. The format of this file is one rule per line in the NAME=VALUE format. An example of the contents of this file is given below. Note that the “VALUE” can be either “True” or “False”.

 APITrace.APIRules.RefTracker=True
 APITrace.APIRules.BlockingWrite=False
 APITrace.APIRules.BadWorkGroupSize=True
 APITrace.APIRules.RetCodeAnalyzer=True
 APITrace.APIRules.DataTransferAnalyzer=True
 APITrace.APIRules.SyncAnalyzer=True
 APITrace.APIRules.DeprecatedFunctionAnalyzer=True

Format of API filter configuration file (argument passed to –apifilterfile)

If you want to specify a set of OpenCL™ APIs to ignore when collecting an API trace using the command line, you need to pass the name of a configuration file to the –apifilterfile option. The format of this file is one API name per line. An example of the contents of this file is given below.

 clGetPlatformIDs
 clGetPlatformInfo
 clGetDeviceIDs
 clGetDeviceInfo
 clGetContextInfo
 clGetCommandQueueInfo
 clGetSupportedImageFormats
 clGetMemObjectInfo
 clGetImageInfo
 clGetSamplerInfo
 clGetProgramInfo
 clGetProgramBuildInfo
 clGetKernelInfo
 clGetKernelWorkGroupInfo
 clGetEventInfo
 clGetEventProfilingInfo

Format of occupancy display configuration file (argument passed to –occupancydisplay)

In order to generate a Kernel Occupancy HTML display file using the command line, you need to pass the name of a configuration file to the –occupancydisplay option. The format of this configuration file is one parameter per line in the NAME=VALUE format. An example of the contents of this file is given below. The “VALUES” are taken from a generated .occupancy file for a particular kernel.

 ThreadID=3364
 CallIndex=101
 KernelName=reduce
 DeviceName=Capeverde
 ComputeUnits=10
 MaxWavesPerComputeUnit=40
 MaxWorkGroupPerComputeUnit=16
 MaxVGPRs=256
 MaxSGPRs=512
 MaxLDS=32768
 UsedVGPRs=11
 UsedSGPRs=20
 UsedLDS=4096
 WavefrontSize=64
 WorkGroupSize=256
 WavesPerWorkGroup=4
 MaxWorkGroupSize=256
 MaxWavesPerWorkGroup=4
 GlobalWorkSize=256
 MaxGlobalWorkSize=16777216
 WavesLimitedByVGPR=40
 WavesLimitedBySGPR=40
 WavesLimitedByLDS=32
 WavesLimitedByWorkgroup=40
 Occupancy=80