INDEX
Explanations
words or phrases that are used frequently or repeatedly in a given context
repetitive phrases emphasizing frequency of occurrence
New Auto-Interp
Negative Logits
utenberg
-0.84
kay
-0.77
agate
-0.76
gur
-0.72
gae
-0.70
shi
-0.69
plates
-0.68
ged
-0.66
zbek
-0.65
imming
-0.65
POSITIVE LOGITS
entimes
1.38
overlooked
0.97
encountered
0.91
resorted
0.91
referred
0.84
misunderstood
0.83
cited
0.83
theless
0.82
relied
0.81
touted
0.81
Activations Density 0.034%