INDEX
Explanations
references to specific individuals, particularly in a corporate context
New Auto-Interp
Negative Logits
abis
-0.17
enty
-0.16
cribe
-0.16
onne
-0.16
lish
-0.16
combe
-0.15
apore
-0.15
unge
-0.15
ukes
-0.15
anship
-0.14
POSITIVE LOGITS
Cro
0.23
Gro
0.22
Cro
0.22
cro
0.20
gro
0.20
gro
0.20
Gro
0.19
tro
0.18
Tro
0.17
tro
0.16
Activations Density 0.054%