-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
generators from corpora #30
Comments
Oh I really like the idea of having the dotted generators! I'm wondering how to source them though. We'll need to do some little research about a dictionary. Then we can embed it in the binary with go-bindata, I used it often and it's nice. Would you like to give it a try @jorinvo ? If so I assign the issue to you. No strings attached though! I'll have some time later this week/next week and I planned to add more generators |
This looks interesting: https://github.com/dariusk/corpora There are many other useful categories. Some examples: Since the list of generators will probably grow no matter how many we decide to add now, If we figure this out, I can give it a try. |
I think I'd split the problem in parts (the usual "divide et impera"):
Possibly I'd do the first two steps in a PR and just open issues/adding new generators on the go. I like the idea of adding as many generators as possible given people use them. Otherwise it feels like "over-engineering" (maybe not a great term though). I like corpora:
I'd be fine following the lead of corpora about how to organize the naming of the generators and go for something like: $ fakedata dict.animals # generates random animal
$
$ fakedata dcit.animals.cats # generates random cat breed Which would be funny to implement too (nice side-effect :)). I'm a bit unsure about the top name
none of them makes me happy though as they don't seem very fitting so, as always, feedback is more than welcome! Does this organization works? I feel like it's "good enough" to get us started |
Can i ask for loadable dictionaries, so there is a small possibility to use other dictionaries that english? |
@gnanet sure you can! But I believe it makes sense to discuss it in a separate issue, I think the feature may have a similar behavior but it needs a different user interface. |
dariusk/corpora seems like good place to start with.
As you pointed out, the organization in that repo looks pretty good already.
I think we should add only the once that appear useful to us.
If we import the code manually we can stick to the existing data. We could still extend it. I would separate the current What do you think @lucapette ? |
I know go-bindata isn't maintained anymore but, to be fair, I've been using it a lot and had no real issues with it. But your suggestion of just keeping everything as go code is very appealing to me, the project remains go gettable and that's pretty nice. So I'd say we proceed as you say. I buy your point of skipping the parent namespace. And thank you very much for the suggestion, sometimes the simplest solution is hard to see! I would still suggest we automate the process of importing data from corpora, we could add a I love the idea of getting a About adding new generators, I agree we shouldn't import everything so I say we decide what's worth importing upfront in the context of this issue. To wrap up,
@jorinvo what do you think? |
Sounds perfect! |
dictionary
could return a random word from an English dictionary (or a top 1000 words list).Along with this there could be
dictionary.noun
,dictionary.verb
,dictionary.adjective
.The text was updated successfully, but these errors were encountered: