INDEX
Explanations
mentions of frequency or repetition
frequent use of the word "often" or references to repetitive occurrences
New Auto-Interp
Negative Logits
utenberg
-0.92
abad
-0.79
plates
-0.78
agate
-0.71
ENE
-0.70
gae
-0.70
oops
-0.70
jriwal
-0.69
eki
-0.68
Integrity
-0.67
POSITIVE LOGITS
entimes
1.46
overlooked
1.11
times
0.92
misunderstood
0.91
referred
0.91
theless
0.89
cited
0.89
mistaken
0.89
resorted
0.87
times
0.87
Activations Density 0.044%