diff --git a/admin_manual/ai/app_assistant.rst b/admin_manual/ai/app_assistant.rst index 89796d6f16b..a5e5656e92a 100644 --- a/admin_manual/ai/app_assistant.rst +++ b/admin_manual/ai/app_assistant.rst @@ -111,6 +111,13 @@ In order to make use of our AI agent feature, offering the execution of actions You will also need a text processing provider as specified above (ie. *llm2* or *integration_openai*). +Text-To-Speech +~~~~~~~~~~~~~~ + +In order to make use of Text-To-Speech, you will need an app that provides a Text-To-Speech backend: + +* *integration_openai* - Integrates with the OpenAI API to provide AI functionality from OpenAI servers (Customer support available upon request; see :ref:`AI as a Service`) + Configuration ------------- diff --git a/admin_manual/ai/overview.rst b/admin_manual/ai/overview.rst index cf1e2be875d..2cf812f0dc8 100644 --- a/admin_manual/ai/overview.rst +++ b/admin_manual/ai/overview.rst @@ -66,6 +66,7 @@ Nextcloud uses modularity to separate raw AI functionality from the Graphical Us "Context Chat","`Nextcloud Assistant Context Chat `_","Yellow","Yes","Yes","No","Yes" "","`Nextcloud Assistant Context Chat (Backend) `_","Yellow","Yes","Yes","No","Yes" "Context Agent","`Nextcloud Context Agent `_","Green","Yes","Yes","Yes","Yes" + "Text To Speech","`Open AI Text To Speech `_","Red","No","No","No","No" Ethical AI Rating diff --git a/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_32.rst b/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_32.rst index d960f5bb1f3..a73b0d8ee02 100644 --- a/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_32.rst +++ b/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_32.rst @@ -36,7 +36,7 @@ Back-end changes Added APIs ^^^^^^^^^^ -- TBD +- New task processing task type ``OCP\TaskProcessing\TextToSpeech`` to convert text to speech. Changed APIs ^^^^^^^^^^^^ diff --git a/developer_manual/digging_deeper/task_processing.rst b/developer_manual/digging_deeper/task_processing.rst index d6267dcba96..a52395ac782 100644 --- a/developer_manual/digging_deeper/task_processing.rst +++ b/developer_manual/digging_deeper/task_processing.rst @@ -116,6 +116,11 @@ The following built-in task types are available: * ``input``: ``Text`` * Output shape: * ``output``: ``Text`` + * ``'core:text2speech'``: This task type is for generating speech from text prompts. It is implemented by ``\OCP\TaskProcessing\TaskTypes\TextToSpeech`` + * Input shape: + * ``input``: ``Text`` + * Output shape: + * ``speech``: ``Audio`` Task types can be disabled in the AI admin settings so they are not available for the Assistant or other apps even if they are implemented. All implemented Task types are enabled by default.