INDEX
Explanations
phrases indicating a lack of knowledge or certainty
instances of the phrase "I had no idea."
New Auto-Interp
Negative Logits
iki
-0.72
Pers
-0.69
inka
-0.67
die
-0.63
rup
-0.62
Tro
-0.62
pu
-0.61
Minor
-0.60
DF
-0.60
visor
-0.60
POSITIVE LOGITS
whatsoever
1.09
why
0.87
how
0.86
ledged
0.80
WHY
0.77
whence
0.77
ariat
0.77
whats
0.74
squat
0.72
what
0.72
Activations Density 0.036%