INDEX
Explanations
phrases related to specific situations or events
instances of the word "when."
New Auto-Interp
Negative Logits
agin
-0.80
thal
-0.66
Bas
-0.63
ictive
-0.63
yan
-0.63
Es
-0.62
ha
-0.62
aches
-0.62
gan
-0.61
bas
-0.61
POSITIVE LOGITS
soever
1.23
asked
0.82
confronted
0.81
irlf
0.79
pressed
0.78
they
0.76
contacted
0.72
*/(
0.71
she
0.71
faced
0.70
Activations Density 0.124%