INDEX
Explanations
mentions of products or items being evaluated or discussed in detail
New Auto-Interp
Negative Logits
cade
-0.42
earchers
-0.40
wig
-0.39
head
-0.37
ppa
-0.37
Unsure
-0.37
uations
-0.36
uyomi
-0.36
oiler
-0.35
bang
-0.35
POSITIVE LOGITS
ours
0.58
sorts
0.57
theirs
0.49
hers
0.49
everyday
0.45
mine
0.43
humanity
0.41
yours
0.41
virtue
0.40
bip
0.40
Activations Density 12.216%