INDEX
Explanations
occurrences of the word "lot" and related phrases indicating quantity or abundance
New Auto-Interp
Negative Logits
robe
-0.76
upper
-0.73
iary
-0.70
older
-0.67
tri
-0.66
anamo
-0.65
ammad
-0.64
RESULTS
-0.64
reek
-0.62
omer
-0.62
POSITIVE LOGITS
Brus
0.65
EStreamFrame
0.65
orns
0.60
oln
0.59
ipedia
0.59
bies
0.58
0.57
lez
0.55
agher
0.54
iced
0.54
Activations Density 0.051%