-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Switch to TF 2.0 and new NLU components #5266
Merged
Changes from 250 commits
Commits
Show all changes
829 commits
Select commit
Hold shift + click to select a range
9cdea3a
fix test pipelines
Ghostvv 22775e4
black
Ghostvv 5e94e13
Merge branch 'tf2' into tf2-val
Ghostvv f03c204
Merge branch 'tf2-val' into tf2-old-crf
Ghostvv 99c8cdd
reuse existing methods
tabergma a175f3c
update docs
tabergma 2623866
black formatting
tabergma 2843007
Merge branch 'master' into tf2
tabergma 9a34f59
fix pbar in convertfeaturizer
Ghostvv 176927c
merge tf2
Ghostvv 3952704
raise deprecation warning
tabergma 0f489df
Merge branch 'tf2' into tf2-val
tabergma 2fa3106
fix test
tabergma ebe604f
Merge branch 'tf2-val' into tf2-old-crf
tabergma 83e450a
Merge branch 'tf2' into tf2-val
tabergma 3e4b79f
Merge pull request #5259 from RasaHQ/tf2-val
tabergma 792024d
Merge branch 'tf2' into tf2-old-crf
tabergma 79451db
add bias feature again
tabergma d11a4eb
added changelog entries
tabergma 828b795
review comments
tabergma b41ea6b
add missing comma
tabergma 2dd3841
refactored model loading for convert
dakshvar22 b47a45d
add test for docker configs
tabergma 1646fab
fix docs
tabergma 8109eed
added class descriptions
dakshvar22 b2f70bd
Merge branch 'tf2' of github.com:RasaHQ/rasa into tf2
dakshvar22 2df9c9a
remove colon
Ghostvv b51c020
suppress logging statement for tensorflow version from transformers
dakshvar22 43eb701
Merge branch 'tf2' of github.com:RasaHQ/rasa into tf2
dakshvar22 0fb940d
use configs from files in docs
tabergma 8199fcf
review comments
dakshvar22 95c92ab
Merge branch 'tf2' of github.com:RasaHQ/rasa into tf2
dakshvar22 f297015
add links to pipeline docs
dakshvar22 9822b31
added language model specific info to docs
dakshvar22 5fb1e09
fix typo
dakshvar22 0d39d24
Merge pull request #5267 from RasaHQ/tf2-old-crf
Ghostvv 357f2c4
make sparsity configurable, Response selector is a subclass of diet s…
Ghostvv b6d2667
rename constant
Ghostvv d9f624a
Merge branch 'tf2' into tf2-params
Ghostvv b74c7d1
fix duplicate link
dakshvar22 e2caee0
use self.epochs to set current epoch
Ghostvv c976905
Merge pull request #5273 from RasaHQ/pipeline-docs
Ghostvv a75e70c
Merge branch 'tf2' into tf2-params
Ghostvv ccca6ac
update links in changelogs
tabergma ccf3789
remove diet selector
Ghostvv 2444e01
add changelog for removed mitie docker image
tabergma d2426f9
add weight sparsity to the docs
Ghostvv c4871b4
remove doc markers
Ghostvv 672ea5b
review comments on docs
tabergma f67b634
made transformers lib optional and removed a few other deps
dakshvar22 0223cf6
made transformers lib optional and removed a few other deps
dakshvar22 fff220a
merge conflict
dakshvar22 f1855b8
review comments on docs
tabergma a45468a
Update docs/nlu/components.rst
dakshvar22 b82ff9b
fix link in docs
Ghostvv 2e28cc2
remove diet selector
Ghostvv f62c65b
remove doc markers
Ghostvv a8870e3
fix link in docs
Ghostvv e63051f
Merge branch 'tf2-selector' of https://github.com/RasaHQ/rasa into tf…
Ghostvv dbb6635
review comments
tabergma a06cff4
fix imports in tests/utitlities.py
tabergma 1899f44
merge tf2
Ghostvv 1a8b820
use json.dump and json.load in lexical syntactic featurizer
tabergma 140dba9
retrieval_intent is now a constant
tabergma 181c7a1
merge tf2
Ghostvv 5602304
made transformers lib optional and removed a few other deps
dakshvar22 5621809
Update docs/nlu/components.rst
dakshvar22 5dc7fba
Merge branch 'transformers-pipeline' of github.com:RasaHQ/rasa into t…
dakshvar22 41266a0
renaming functions
tabergma 3cfd243
droprate -> drop rate
tabergma 6378522
bump tensorflow text to use latest versions
dakshvar22 c8c3c3b
fixing persisting lexical syntactic featurizer
tabergma ab16d5d
Merge branch 'tf2' into transformers-pipeline
dakshvar22 f555d09
Merge branch 'tf2' into bump_tensorflow_text
dakshvar22 7d61e4f
merge tf2-params
Ghostvv dc1b55f
Merge pull request #5275 from RasaHQ/tf2-selector
Ghostvv 14dfdf5
improve docstrings of components
tabergma f1f6e77
Merge branch 'master' into tf2
tabergma d6b7612
merge tf2
Ghostvv 45a11a1
update docs on model options
tabergma 179ad4f
revert back to old requirements
dakshvar22 364d96a
fix merge conflict
dakshvar22 b6b1ff9
fix persisting and loading of ted policy
tabergma 8ccbb39
Merge branch 'tf2' into bump_tensorflow_text
dakshvar22 6b4e7b6
removed unnecessary deps again
dakshvar22 281e14a
Merge branch 'tf2' into transformers-pipeline
dakshvar22 7a74f98
remove flask from test
tabergma f98d53e
Merge branch 'tf2' into transformers-pipeline
dakshvar22 ff0ae98
Merge pull request #5276 from RasaHQ/transformers-pipeline
dakshvar22 fadb74f
Merge branch 'tf2' into bump_tensorflow_text
dakshvar22 e9f5a2d
don't import raise_warning directly
tabergma 8d57ba8
Merge branch 'master' into tf2
tabergma a460e5a
Merge pull request #5278 from RasaHQ/bump_tensorflow_text
dakshvar22 abcdc2e
Merge branch 'tf2' into tf2-sparsitz
tabergma 7589027
review comment
tabergma 72cc021
add missing masked_lm option to response selector
tabergma 1c09550
Use ResponseSelector instead of DIETSelector
tabergma c2eef82
Merge pull request #5282 from RasaHQ/tf2-sparsitz
tabergma 3200732
clean up NLU tests
tabergma 8f89857
Merge branch 'tf2' into tf2-tests
tabergma 6a205ac
update diet classifier test
tabergma da4b6e2
Merge branch 'master' into tf2
tabergma 9ed77a3
Merge branch 'tf2' into tf2-tests
tabergma 5c8ed35
clean up
tabergma 33015fd
update example configs
tabergma 63f5f69
reduce number of train epochs
tabergma eb81127
fix random seed test
tabergma f1cc9a7
raise exception instead of NotImplemented
tabergma 989f5fd
added mitie docker image again
tabergma 95f5fb5
clean up imports
tabergma 446ff97
update config path in docker file
tabergma 39aeed0
make comment start from capital S
Ghostvv dd6f1c8
refactor updating EVAL_NUM_EPOCHS
tabergma 90c1203
fix tests
tabergma 4d6eb7e
move pickle dump and load to io utils
tabergma 64ff5ca
review comments
tabergma 16466ef
review comments
tabergma 765939f
use jsonpickle instead of pickle
tabergma cf84917
Merge branch 'tf2' into tf2-tests
tabergma 8eb283f
fix types
tabergma cbe6f10
Merge branch 'tf2' into tf2-tests
tabergma d5ac30e
print warning on epochs not set.
tabergma b45e1f4
deprecate provides and requires in nlu
Ghostvv bb9bf49
Merge branch 'tf2' into tf2-required
Ghostvv 6e76414
fix entity extractor import
Ghostvv 9d07852
fix loading TED policy
tabergma 3488f7f
Merge branch 'master' into tf2
tabergma 064e946
Merge branch 'tf2' into tf2-tests
tabergma 5bb9b7f
Merge branch 'tf2' into tf2-required
tabergma 4f8a7ad
check if tag id dict exists.
tabergma 9fa9bfd
Merge branch 'tf2' into tf2-tests
tabergma a9df694
Merge branch 'tf2' into tf2-required
tabergma d12ff0a
update docstrings in components.py
tabergma befeac5
add empty pipeline validation
Ghostvv 0329a43
merge tf2
Ghostvv 4dd23d0
fix refs in docstings
tabergma 287343c
change json_pickle to pickle_dump
Ghostvv dc932b7
remove all traces of component.required and provides
Ghostvv 9e3b51b
rename test
Ghostvv b78f5bc
merge tf2
Ghostvv 0c281bb
force_download of HF model weights
tabergma fe6b90a
add docstrings to Policy
Ghostvv d1ae222
Merge pull request #5305 from RasaHQ/tf2-docstrings
Ghostvv 92a90fa
fix test
Ghostvv 2dc9215
merge tf2
Ghostvv 6a12f02
review comments on docs
tabergma 3c06032
fix tests
Ghostvv bfcde0d
Update data/configs_for_docs/default_config.yml
Ghostvv ac8c2ec
Update data/configs_for_docs/default_english_config.yml
Ghostvv 8d0c3dd
Update data/configs_for_docs/default_spacy_config.yml
Ghostvv 877e52f
Update data/configs_for_docs/pretrained_embeddings_convert_config_1.yml
Ghostvv 4874095
Update data/configs_for_docs/pretrained_embeddings_convert_config_2.yml
Ghostvv 0543f7a
Update data/configs_for_docs/pretrained_embeddings_spacy_config_1.yml
Ghostvv 628b07a
Update data/configs_for_docs/pretrained_embeddings_spacy_config_2.yml
Ghostvv d9f06c8
Update data/configs_for_docs/supervised_embeddings_config_1.yml
Ghostvv 5ae181d
Update data/configs_for_docs/supervised_embeddings_config_2.yml
Ghostvv 36a5be9
Update rasa/core/policies/keras_policy.py
Ghostvv c4d1eef
create DOCS_URL_MIGRATION_GUIDE
Ghostvv 1d7aead
update choosing a pipeline.
tabergma 350bc80
undo changes
tabergma aa7171b
refactor config checks
Ghostvv ecf939b
create removal changelog
Ghostvv 6c7f3f7
Merge pull request #5293 from RasaHQ/tf2-required
Ghostvv 1e77c69
Update rasa/nlu/classifiers/diet_classifier.py
Ghostvv 18f7d9a
Update rasa/nlu/classifiers/diet_classifier.py
Ghostvv db0d794
set num_tags to None in init
Ghostvv b02135c
substitute Any with Type[...]
Ghostvv 1831f46
Update rasa/nlu/classifiers/diet_classifier.py
Ghostvv 8728599
Update rasa/nlu/classifiers/diet_classifier.py
Ghostvv 6bf316f
Merge branch 'tf2' into tf2-tests
tabergma 3915a0b
add missing import
tabergma 84bced6
fix types
tabergma c7e2199
Merge pull request #5290 from RasaHQ/tf2-tests
tabergma 3f8fd72
Merge branch 'master' into tf2
tabergma 0ae0e58
fix incorrect import
tabergma 682455f
Merge branch 'tf2' into tf2-docs
tabergma 2b36f1e
Merge pull request #5306 from RasaHQ/tf2-docs
tabergma f914fbe
documentation review comments
tabergma 13985f5
documentation review comments
tabergma 78b32ca
documentation review comments
tabergma b758670
documentation review comments
tabergma 1cb7c53
substitute loss and sim strings with constants
Ghostvv 12c208b
fix doc warnings
tabergma 4504271
address rasa init problems
tabergma 2175f27
documentation review comments
tabergma 8e6e586
fix formatting error
akelad c22adff
update migration guide
tabergma 81caf9e
modify comments
Ghostvv 5892968
Merge branch 'tf2' of https://github.com/RasaHQ/rasa into tf2
Ghostvv 0ecff89
update choosing a pipeline
tabergma 9ee5cf8
add note for old terminology.
tabergma 7a865ab
undo docker changes
tabergma 9cc389d
refactor data helpers
Ghostvv 82117b2
Merge branch 'tf2' of https://github.com/RasaHQ/rasa into tf2
Ghostvv 8cfa6e5
substitute feature name strings with constants
Ghostvv 292134a
refactor layers preparation
Ghostvv 3057c1e
update components.rst
tabergma 85ff063
update choosing a pipeline
tabergma d92f689
quick fix for docs typos/formatting
akelad 830c66d
use migration guide constant
tabergma c0afb86
refactor loss and f1 helpers
Ghostvv 9688646
Merge branch 'tf2' of https://github.com/RasaHQ/rasa into tf2
Ghostvv 4de13a8
review comments on featurizers
tabergma cf96b61
fix docstrings in components
Ghostvv 6fce774
merge tf2
Ghostvv 935b90d
review comments on lexical_syntactic_featuirzer.
tabergma 4f657fb
review comments on convert
tabergma 208f5e4
review comments on hugging face components
tabergma b36a712
Merge branch 'master' into tf2
tabergma c5b337d
rename inverted tag and label dicts
Ghostvv cb9cd5a
Merge branch 'tf2' of https://github.com/RasaHQ/rasa into tf2
Ghostvv ea84dc9
remove _find_example_for_tag
Ghostvv dc2d5a9
remove setting numpy random seed in train
Ghostvv a306da7
review comments
tabergma be46495
create no entity tag constant
Ghostvv 54bd698
Merge branch 'tf2' of https://github.com/RasaHQ/rasa into tf2
Ghostvv bb2b6cb
add type to tf_layers
Ghostvv d1aa219
update constants comment
Ghostvv 12bdf87
remove magic numbers probs
Ghostvv 4eda2e5
fix type of Data in model data
Ghostvv f1f6c43
add axis=
Ghostvv 0542b28
add explanatory comments
Ghostvv 1e8b7b9
check if responses are present.
tabergma 937813d
review comments
tabergma 2886ea0
add comment and type
Ghostvv e2e5139
Merge branch 'tf2' of https://github.com/RasaHQ/rasa into tf2
Ghostvv 483713b
rename relative lengths
Ghostvv c6e6f27
remove batch_tuple_sizes
tabergma 502ef22
review comments
tabergma baab754
review comments
tabergma 1cb18d2
add docstring
tabergma ccf25a4
add comments to model data
Ghostvv d44181a
add comments to model_data
Ghostvv eb5cf6b
create tmp dir for convert
tabergma 67dad7a
update type
tabergma e0ab5f7
add comments
Ghostvv d58534b
Merge branch 'tf2' of https://github.com/RasaHQ/rasa into tf2
Ghostvv 4cf5fff
change comment
Ghostvv 4fefe76
recalculate number of examples after balancing
Ghostvv 8443d50
reorganize methods in model_data
Ghostvv 1c5f9da
remove num_neg check from ted
Ghostvv e06caaf
update requirements
Ghostvv 2854674
fix nlu comparison test
tabergma 554ad4c
update requirements
Ghostvv c7bbae1
Merge branch 'tf2' of https://github.com/RasaHQ/rasa into tf2
Ghostvv 727ed61
update version
Ghostvv c33e408
Update alt_requirements/requirements_pretrained_embeddings_convert.txt
Ghostvv 1956976
Fixed an issue with AWS persistor
7229a3e
Merge branch 'master' into tf2
Ghostvv 189355b
Merge branch 'master' into tf2
tmbo File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
4 changes: 2 additions & 2 deletions
4
alt_requirements/requirements_pretrained_embeddings_convert.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,5 @@ | ||
# Minimum Install Requirements | ||
-r ../requirements.txt | ||
|
||
tensorflow_text==1.15.1 | ||
tensorflow_hub==0.6.0 | ||
tensorflow_text==2.1.0rc0 | ||
tensorflow_hub==0.7.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,4 @@ | ||
Part of Slack sanitization: | ||
Multiple garbled URL's in a string coming from slack will be converted into actual strings. ``Example: health check of <http://eemdb.net|eemdb.net> and <http://eemdb1.net|eemdb1.net> to health check of eemdb.net and eemdb1.net`` | ||
Multiple garbled URL's in a string coming from slack will be converted into actual strings. | ||
``Example: health check of <http://eemdb.net|eemdb.net> and <http://eemdb1.net|eemdb1.net> to health check of | ||
eemdb.net and eemdb1.net`` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
Add :ref:`LexicalSyntacticFeaturizer` to sparse featurizers. | ||
|
||
``LexicalSyntacticFeaturizer`` does the same featurization as the ``CRFEntityExtractor``. We extracted the | ||
tabergma marked this conversation as resolved.
Show resolved
Hide resolved
|
||
featurization into a separate component so that the features can be reused and featurization is independent from the | ||
entity extraction. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,7 @@ | ||
Integrate language models from HuggingFace's `Transformers <https://github.com/huggingface/transformers>`_ Library. | ||
|
||
Add a new NLP component :ref:`HFTransformersNLP <HFTransformersNLP>` which | ||
tabergma marked this conversation as resolved.
Show resolved
Hide resolved
|
||
tokenizes and featurizes incoming messages using a specified pre-trained model with the Transformers library as the backend. | ||
Add ``LanguageModelTokenizers`` and ``LanguageModelFeaturizers`` which use the information from ``HFTransformersNLP`` | ||
tabergma marked this conversation as resolved.
Show resolved
Hide resolved
|
||
and sets them correctly for message object. | ||
Language models currently supported: BERT, OpenAIGPT, GPT-2, XLNet, DistilBert, RoBERTa |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
Refactor how GPU and CPU environments are configured for TensorFlow 2.0. | ||
|
||
Please refer to the :ref:`documentation <tensorflow_usage>` to understand | ||
which environment variables to set in what scenarios. A couple of examples are shown below as well: | ||
|
||
.. code-block:: python | ||
|
||
# This specifies to use 1024 MB of memory from GPU with logical ID 0 and 2048 MB of memory from GPU with logical ID 1 | ||
TF_GPU_MEMORY_ALLOC="0:1024, 1:2048" | ||
|
||
# Specifies that at most 3 CPU threads can be used to parallelize multiple non-blocking operations | ||
TF_INTER_OP_PARALLELISM_THREADS="3" | ||
|
||
# Specifies that at most 2 CPU threads can be used to parallelize a particular operation. | ||
TF_INTRA_OP_PARALLELISM_THREADS="2" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
Added a new NLU component ``DIETClassifier`` and a new policy ``TEDPolicy``. | ||
tabergma marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
DIET (Dual Intent and Entity Transformer) is a multi-task architecture for intent classification and entity | ||
recognition. You can read more about this component in our :ref:`documentation <diet-classifier>`. | ||
The new component will replace the ``EmbeddingIntentClassifier`` and the ``CRFEntityExtractor`` in the future. | ||
Those two components are deprecated from now on. | ||
See :ref:`migration guide <migration-to-rasa-1.8>` for details on how to | ||
switch to the new component. | ||
|
||
``TEDPolicy`` is the new name for ``EmbeddingPolicy``. ``EmbeddingPolicy`` is deprecated from now on. | ||
The functionality of ``TEDPolicy`` and ``EmbeddingPolicy`` is the same. Please update your configuration file | ||
to use the new name for the policy. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
We updated our code to TensorFlow 2. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,9 @@ | ||
We deprecated all existing pipeline templates, ``SklearnIntentClassifier`` and ``KerasPolicy``. | ||
tabergma marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
Please list the components you want to use directly in your configuration file. | ||
Check out :ref:`Choosing a Pipeline <choosing-a-pipeline>` to decide what components to | ||
include in your pipeline. | ||
|
||
Use ``DIETClassifier`` instead of ``SklearnIntentClassifier``. | ||
|
||
Use ``TEDPolicy`` instead of ``KerasPolicy``. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
The sentence vector of the ``SpacyFeaturizer`` and ``MitieFeaturizer`` can be calculated using max or mean pooling. | ||
|
||
To specify the pooling operation, set the option ``pooling`` for the ``SpacyFeaturizer`` or the ``MitieFeaturizer`` | ||
tabergma marked this conversation as resolved.
Show resolved
Hide resolved
|
||
in your configuration file. The default pooling operation is ``mean``. The mean pooling operation also does not take | ||
into account words, that do not have a word vector. | ||
See our :ref:`documentation <components>` for more details. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,3 @@ | ||
The `EmbeddingPolicy <https://rasa.com/docs/rasa/core/policies/#embedding-policy>`_ | ||
replaces the ``KerasPolicy`` in new Rasa projects generated with ``rasa init``. | ||
The `EmbeddingPolicy <https://rasa.com/docs/rasa/core/policies/#embedding-policy>`_ | ||
is now the recommended machine learning policy. Please see the | ||
`migration guide <https://rasa.com/docs/rasa/migration-guide/#rasa-1-7-to-rasa-1-8>`_ | ||
if you want to switch to this new policy in an existing project. | ||
The :ref:`TEDPolicy <ted_policy>` replaces the ``KerasPolicy`` in new Rasa projects generated with ``rasa init``. | ||
The :ref:`TEDPolicy <ted_policy>` is now the recommended machine learning policy. Please see the | ||
:ref:`migration guide <migration-to-rasa-1.8>` if you want to switch to this new policy in an existing project. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: WhitespaceTokenizer | ||
- name: RegexFeaturizer | ||
- name: LexicalSyntacticFeaturizer | ||
- name: CountVectorsFeaturizer | ||
- name: CountVectorsFeaturizer | ||
analyzer: "char_wb" | ||
min_ngram: 1 | ||
max_ngram: 4 | ||
- name: DIETClassifier | ||
- name: EntitySynonymMapper | ||
- name: DIETSelector |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: ConveRTTokenizer | ||
- name: ConveRTFeaturizer | ||
- name: RegexFeaturizer | ||
- name: LexicalSyntacticFeaturizer | ||
- name: CountVectorsFeaturizer | ||
- name: CountVectorsFeaturizer | ||
analyzer: "char_wb" | ||
min_ngram: 1 | ||
max_ngram: 4 | ||
- name: DIETClassifier | ||
- name: EntitySynonymMapper | ||
- name: DIETSelector |
3 changes: 3 additions & 0 deletions
3
data/configs_for_docs/pretrained_embeddings_convert_config_1.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
language: "en" | ||
|
||
pipeline: "pretrained_embeddings_convert" | ||
tabergma marked this conversation as resolved.
Show resolved
Hide resolved
Ghostvv marked this conversation as resolved.
Show resolved
Hide resolved
|
6 changes: 6 additions & 0 deletions
6
data/configs_for_docs/pretrained_embeddings_convert_config_2.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: "ConveRTTokenizer" | ||
- name: "ConveRTFeaturizer" | ||
- name: "EmbeddingIntentClassifier" | ||
Ghostvv marked this conversation as resolved.
Show resolved
Hide resolved
|
File renamed without changes.
File renamed without changes.
3 changes: 3 additions & 0 deletions
3
data/configs_for_docs/pretrained_embeddings_spacy_config_1.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
language: "en" | ||
|
||
pipeline: "pretrained_embeddings_spacy" | ||
Ghostvv marked this conversation as resolved.
Show resolved
Hide resolved
|
10 changes: 10 additions & 0 deletions
10
data/configs_for_docs/pretrained_embeddings_spacy_config_2.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: "SpacyNLP" | ||
- name: "SpacyTokenizer" | ||
- name: "SpacyFeaturizer" | ||
- name: "RegexFeaturizer" | ||
- name: "CRFEntityExtractor" | ||
- name: "EntitySynonymMapper" | ||
Ghostvv marked this conversation as resolved.
Show resolved
Hide resolved
|
||
- name: "SklearnIntentClassifier" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
language: "en" | ||
|
||
pipeline: "supervised_embeddings" | ||
Ghostvv marked this conversation as resolved.
Show resolved
Hide resolved
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: "WhitespaceTokenizer" | ||
- name: "RegexFeaturizer" | ||
- name: "CRFEntityExtractor" | ||
- name: "EntitySynonymMapper" | ||
- name: "CountVectorsFeaturizer" | ||
- name: "CountVectorsFeaturizer" | ||
analyzer: "char_wb" | ||
min_ngram: 1 | ||
max_ngram: 4 | ||
- name: "EmbeddingIntentClassifier" | ||
Ghostvv marked this conversation as resolved.
Show resolved
Hide resolved
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: "MitieNLP" | ||
model: "data/total_word_feature_extractor.dat" | ||
- name: "MitieTokenizer" | ||
- name: "MitieEntityExtractor" | ||
- name: "EntitySynonymMapper" | ||
- name: "RegexFeaturizer" | ||
- name: "MitieFeaturizer" | ||
- name: "SklearnIntentClassifier" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: "MitieNLP" | ||
model: "data/total_word_feature_extractor.dat" | ||
- name: "MitieTokenizer" | ||
- name: "MitieEntityExtractor" | ||
- name: "EntitySynonymMapper" | ||
- name: "RegexFeaturizer" | ||
- name: "MitieIntentClassifier" |
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,4 @@ | ||
policies: | ||
- name: EmbeddingPolicy | ||
- name: TEDPolicy | ||
random_seed: 42 | ||
epochs: 2 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
language: "en" | ||
tabergma marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
pipeline: | ||
- name: ConveRTTokenizer | ||
- name: ConveRTFeaturizer | ||
- name: RegexFeaturizer | ||
- name: LexicalSyntacticFeaturizer | ||
- name: CountVectorsFeaturizer | ||
- name: CountVectorsFeaturizer | ||
analyzer: "char_wb" | ||
min_ngram: 1 | ||
max_ngram: 4 | ||
- name: DIETClassifier | ||
- name: EntitySynonymMapper | ||
- name: DIETSelector |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
language: "de" | ||
|
||
pipeline: | ||
- name: SpacyNLP | ||
- name: SpacyTokenizer | ||
- name: SpacyFeaturizer | ||
- name: RegexFeaturizer | ||
- name: LexicalSyntacticFeaturizer | ||
- name: CountVectorsFeaturizer | ||
- name: CountVectorsFeaturizer | ||
analyzer: "char_wb" | ||
min_ngram: 1 | ||
max_ngram: 4 | ||
- name: DIETClassifier | ||
- name: EntitySynonymMapper | ||
- name: DIETSelector |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: SpacyNLP | ||
- name: SpacyTokenizer | ||
- name: SpacyFeaturizer | ||
- name: RegexFeaturizer | ||
- name: LexicalSyntacticFeaturizer | ||
- name: CountVectorsFeaturizer | ||
- name: CountVectorsFeaturizer | ||
analyzer: "char_wb" | ||
min_ngram: 1 | ||
max_ngram: 4 | ||
- name: DIETClassifier | ||
- name: EntitySynonymMapper | ||
- name: DIETSelector |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
language: "en" | ||
|
||
pipeline: | ||
- name: WhitespaceTokenizer | ||
- name: RegexFeaturizer | ||
- name: LexicalSyntacticFeaturizer | ||
- name: CountVectorsFeaturizer | ||
- name: CountVectorsFeaturizer | ||
analyzer: "char_wb" | ||
min_ngram: 1 | ||
max_ngram: 4 | ||
- name: DIETClassifier | ||
- name: EntitySynonymMapper | ||
- name: DIETSelector | ||
- name: DucklingHTTPExtractor | ||
url: "http://duckling:8000" | ||
wochinge marked this conversation as resolved.
Show resolved
Hide resolved
|
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@dakshvar22 what was the library you said we could remove?