INDEX
Explanations
phrases or sentences expressing the concept of "not always"
phrases indicating consistency or ongoing situations
New Auto-Interp
Negative Logits
IDA
-0.80
bern
-0.77
pour
-0.72
externalActionCode
-0.72
zai
-0.72
bath
-0.72
hole
-0.70
Wad
-0.70
sonian
-0.70
workshop
-0.69
POSITIVE LOGITS
bothered
0.87
necessarily
0.86
appre
0.84
entimes
0.79
appreciated
0.78
forg
0.77
appe
0.75
portrayed
0.75
mistaken
0.74
theless
0.73
Activations Density 0.017%