How GPTs Learn to Be Helpful

Apr 23

The missing step between raw model and assistant—and why it matters for builders

9 Comments

Another banger post Shivani! One question, since most model predicting one token at a time, does it mean that after each token generated, it keeps repeating the entire process for the next token?

Expand full comment

Reply (1)

Shivani Virdi

Apr 23

Yes Luq, at inference time, it will first generate a token, append it to context and then carry out the inference again for the updated context

Expand full comment

Reply (1)

Luq

Apr 23

So, that means prompt like “show reason step by step, then give answer” is better than “think step by step, but show only answer”. Since the reasoning context is now ‘fixed’ during each new token update, less room for it to go down different reasoning path at each new inference process. Is this correct understanding? Also thanks for entertaining my questions ☺️

Expand full comment

Reply (2)

Shivani Virdi

Apr 23

that’s right Luq, thinking means nothing, models “think” in tokens if it’s in context only then will it pay “attention to those tokens”! Even in reasoning models, that’s exactly what’s happening (the thinking might be hidden from us in the UI, but the model context is enriched with that)!

Happy to answer all questions 😊

Expand full comment

Luq

Apr 23

Thankyouu that helps a lot! You gain new fan here 😝 Looking forward for your next post!

Expand full comment

Rohit Kumar Tiwari

Apr 23

Loved the breakdown @Shivani Virdi. Thanks for sharing!

Expand full comment

Reply (1)

Shivani Virdi

Apr 23

So glad to hear that, Rohit!

Expand full comment

Jiri "Skzites" Fiala

Aug 4

I've also dabbled in AI and GPTs. Once programmed a bot to answer customer queries - it was astounding how fast it picked up nuances. Adoption, though, was the real hurdle.

Expand full comment

Akash Agarwal

Jun 13

Really liked your explanation

Expand full comment