INDEX
Explanations
😊 greetings and positive affirmations
New Auto-Interp
Negative Logits
the
0.68
,
0.64
an
0.61
:
0.61
a
0.59
fails
0.59
creates
0.58
destroys
0.57
objects
0.54
uses
0.51
POSITIVE LOGITS
আপনার
0.78
તમારા
0.76
Congratulations
0.75
Welcome
0.74
Congratulations
0.74
Você
0.74
you
0.72
máte
0.72
Thank
0.71
আপনি
0.71
Activations Density 0.018%