Using the Sample Chat Web Application

Prerequisites

Access the Web Application

  • Connect to the sample web application at http://<host-ip>:8090.

    Sample chat web application

Use Unstructured Documents as a Knowledge Base

  1. Optional: If you configured your deployment with NVIDIA Riva, check [X] Enable TTS output to enable the web application to read aloud the answers to your queries.

    Select the desired ASR language (English (en-US) for this test), TTS language (English (en-US) for this test) and TTS voice from the dropdown menus below the checkboxes to use the voice-to-voice interaction capabilities.

  2. On the Converse tab, enter “How many cores does the Grace superchip contain?” in the chat box and click Submit.

    Alternatively, click on the microphone button to the right of the text box and ask the question verbally.

    Grace query failure

  3. Upload the sample data to the knowledge base.

    Click the Knowledge Base tab and then click Add File.

    Navigate to the dataset.zip file that is located in the notebooks directory. Unzip the archive and upload the PDFs.

  4. Return to Converse tab and select [X] Use knowledge base.

  5. Reenter the question: “How many cores does the Grace superchip contain?”

    Grace query success

    Tip

    The default prompts are optimized for Llama chat model. If you use a completion model, then you must fine tune the prompts.

Troubleshooting

If you receive the following “Media devices could not be accessed” error message when you first attempt to transcribe a voice query, perform the following steps.

Media device access error window.

  1. Open another browser tab and enter chrome://flags in the location field.

  2. Enter insecure origins treated as secure in the search field.

    Browser viewing the chrome://flags URL.

  3. Enter http://<host-ip>:8090 in the text box and select Enabled from the menu.

  4. Click Relaunch.

  5. After the browser opens, grant http://host-ip:8090 access to your microphone.

  6. Retry your request.