INDEX
Explanations
words ending in "person" or "people" in various contexts
references to individuals or personas in various contexts
New Auto-Interp
Negative Logits
Triangle
-0.71
é¾
-0.68
DERR
-0.67
Monaco
-0.66
æĸ¹
-0.62
Bav
-0.61
Franken
-0.61
Tycoon
-0.61
minus
-0.61
Rated
-0.61
POSITIVE LOGITS
pect
1.11
hip
1.01
pers
0.99
istence
0.98
erver
0.95
istent
0.85
cient
0.83
afety
0.82
ervative
0.82
icker
0.80
Activations Density 0.004%