Guides
Log In
Guides

Inference Server CLI

The NuPIC Inference Server is started using the nupic/nupic_inference.sh script.

To use this script, make sure it is made executable using chmod +x nupic/nupic_inference.sh, then use the following syntax:

./nupic_inference.sh start [OPTIONS] | stop

Commands

CommandDescription
startStarts a new NuPIC Inference Server along with monitoring tools (Grafana, Prometheus).
stopStops and removes the running NuPIC Inference Server and monitoring tools.

Options

OptionValueDescription
--cpusCPU indicesDefines the explicit CPUs which NuPIC Inference Server will run on. If not specified, NuPIC Inference Server runs on all available CPU cores.

This can be useful when running NuPIC Inference Server concurrently with other workloads on the same machine.
--exposeNoneExpose NuPIC Inference Server's HTTP port (8000), GRPC" port (8001), and metrics port (8002) to all hosts. If not specified, all these ports are exposed on the loopback interface.

This is required to allow remote clients to make requests to the Inference Server.
--gpusNoneAllow NuPIC Inference Server to use all GPU devices. If not specified, NuPIC Inference Server runs with CPU only.

Note that each model will still have to be individual configured to use the GPU using [instance_group.kind](doc:configuration#instance-group) on the respective config.pbtxt.
--log-levelIntegerSet custom log verbosity for NuPIC Inference Server. The level is an integer value. 0 means verbose logging is disabled. 1 and 2 provides different level of verbose loggings. Level 2 enables most detailed logging. If not specified, the default value is 0.
--shm-sizeBytesSet custom shared memory size (/dev/shm) for NuPIC Inference Server. If not specified, the default shared memory size is 512 MB (512m).
--enable-cachingNoneEnable response caching in NuPIC Inference Server. If not specified, caching is disabled.
--cache-sizeBytesSet custom cache size for NuPIC Inference Server. If not specified, the default cache size is 1GB.
--additional_argsVariousAny additional arguments to pass to Triton Server; e.g., NUMA node configurations.