INDEX
Explanations
phrases related to providing guidance, instructions, or encouragement to others
New Auto-Interp
Negative Logits
Firstly
-0.71
nutshell
-0.69
anny
-0.66
velop
-0.62
entin
-0.61
Frie
-0.61
Owl
-0.61
whichever
-0.58
Guarant
-0.58
Âł Âł Âł Âł Âł Âł Âł Âł
-0.58
POSITIVE LOGITS
similarly
1.50
similar
1.35
likewise
1.18
equally
1.13
same
0.93
similar
0.91
same
0.90
Similar
0.89
emulate
0.87
comparable
0.85
Activations Density 0.575%