INDEX
Explanations
guests or guests' names in a talk show or interview setting
New Auto-Interp
Negative Logits
cling
-0.80
croft
-0.73
ŃĶ
-0.71
baugh
-0.70
spect
-0.68
rings
-0.67
rums
-0.65
hower
-0.65
pillar
-0.63
womb
-0.63
POSITIVE LOGITS
vernment
1.24
errilla
1.12
inea
1.11
itars
1.08
bernatorial
1.07
essing
1.03
arant
1.02
cci
1.00
zman
0.98
ilt
0.98
Activations Density 0.558%