INDEX
Explanations
the word "case" with a high level of activation
repetitive phrases indicating a sense of assertion or claim
New Auto-Interp
Negative Logits
urses
-0.78
todd
-0.77
inders
-0.76
ĪĴ
-0.76
IRE
-0.72
ursday
-0.72
arding
-0.72
ard
-0.72
ardo
-0.70
dropping
-0.68
POSITIVE LOGITS
anymore
0.74
ioned
0.72
scenario
0.71
pheus
0.71
whatsoever
0.71
liest
0.69
,,,,
0.69
Isles
0.67
lla
0.67
pring
0.65
Activations Density 0.025%