Open-source GPT-3

Start new thread

GPT-J - Open-source cousin of GPT-3, everyone can use it

Aditya

Product Hunt

GPT⁠⁠-⁠J-6B, a 6 billion parameter model trained on the Pile, is now available for use with our new codebase, Mesh Transformer JAX.

Replies

Best

Raju Akon

How to Use it?

Report

4yr ago

Aditya

Product Hunt

Hunter

@raju_akon super simple. Just visit https://6b.eleuther.ai/?ref=prod... write a prompt (the topic/reference/plot) on which you want an output, and then click on 'Run the model' 😍

Report

4yr ago

Blake Hunsicker

Bookmarks

This looks really promising. Does GPT-J have a token limit like GPT-3 does?

Report

4yr ago

Aditya

Product Hunt

Hunter

@blakehunsicker yes, it can be fine-tuned at a rate of ~5000 tokens/second, which should be sufficient for small-to-medium-size datasets. Fine tuning instructions are here: https://github.com/kingoflolz/me...

Report

4yr ago

Aditya

Product Hunt

Hunter

Yep! Doors have been OPENED 🤯 An open-source cousin of GPT-3 is here 😇 - Performs on par with 6.7B GPT-3 - Performs better and decodes faster than GPT-Neo - repo + colab + free web demo Got to know about it through Towards Data Science article: https://towardsdatascience.com/c... More details in @arankomatsuzaki's article: https://arankomatsuzaki.wordpres...

Report

4yr ago

Eugene Hauptmann

LLC Toolkit

@arankomatsuzaki @adityavsc omg yes!!

Report

4yr ago

SaaS Blocks by Apideck

@arankomatsuzaki @adityavsc amazing metrics 🤯

Report

4yr ago

Mayank Mishra

Poppins

this is super good for folks to get started with GPT3 @adityavsc

Report

4yr ago

Aditya

Product Hunt

Hunter

@mishra_mayank absolutely!

Report

4yr ago

Pal

Fab find @adityavsc Thanks! How safe has GPT-J been in your usage - any issues of negative sentiments, foul language or worse?

Report

4yr ago

Aditya

Product Hunt

Hunter

@pallpakk some results were definitely weird but overall, it works great! Negative sentiment, foul language, etc are context specific outputs. So if an input is negative/abusive itself, the output is bound to reinforce the same sentiment.

Report

4yr ago

Pal

@adityavsc Thank you for clarifying. I'm definitely looking into this some more!

Report

4yr ago

Ajeya

Ricotta

🔥🔥🔥🔥🔥

Report

4yr ago

Nik Hazell

Zappi Ad Predictor

Amazing! Nice one for hunting this down @adityavsc!

Report

4yr ago

Aditya

Product Hunt

Hunter

@nik_hazell thank you! Anything for open-source 😉

Report

4yr ago

Girdharee Saran

Useful and easy, good luck on launch

Report

4yr ago

Matt Gordon

This Song Plants Trees

Will try this out

Report

4yr ago

Swebliss

Wow this is amazing. Thank you SO much. Is there any way to DM you and ask you something? :)

Report

4yr ago

Ankit Sharma

awesome bro. keep going. It is much needed. Congo 👍👍👍

Report

4yr ago

Aditya

Product Hunt

Hunter

@ankitsharmaofficial all credit goes to the open-source community :)

Report

4yr ago

Nassim Abd

linke

Great, we were waiting for this

Report

4yr ago

Aditya

Product Hunt

Hunter

@nassc for so long! Finally the wait is over :)

Report

4yr ago

Dillon Peterson

Notion2Email

WOAH!! Way to go guys! Thank y'all for putting this together, really amazing.

Report

4yr ago

Aditya

Product Hunt

Hunter

@dillon_peterson all credit goes to the wonderful open-source community :)

Report

4yr ago

Fateh BENMERZOUG, Ph.D

This just blew up the door of textual content generation, awesome!

Report

4yr ago

Aditya

Product Hunt

Hunter

@fateh_benmerzoug IKR! This is literally giving super-powers to the Makers 🚀

Report

4yr ago

Kelvin Zhao

Lila

Will an api of this be created?

Report

4yr ago

Mustafa Al-Adhami

Eugris

Awesome

4yr ago

🔥🔥🔥

4yr ago

Looks really interesting

Report

4yr ago

Pascal Weinberger

Bardeen

the world needs this :)

Report

4yr ago

Patrick Hamelin

*GPT-J is just as good as GPT-3.* It is more efficient, but with more quirks. In our JPRED scores, it did better with simple TCS tasks, but lost with the more complex tasks. By removing the Jordan Algorithm: Our next proposed change to a probability model is removing the Jordan Algorithm. The Jordan Algorithm is a special procedure used for simple TCS tasks that allows for fast analysis of different sequence pairs, as well as being able to easily analyze simple n-gram (aka word) models.It is more efficient, but with more quirks. In our JPRED scores, it did better with simple TCS tasks, but lost with the more complex tasks. ...

Report

4yr ago