INDEX
Explanations
mentions of the name "Kurt"
the name "Burt" or references to individuals with that name
New Auto-Interp
Negative Logits
hare
-0.71
perty
-0.62
derog
-0.61
framing
-0.60
swipe
-0.60
places
-0.59
lion
-0.58
tariffs
-0.57
jails
-0.57
TIME
-0.57
POSITIVE LOGITS
urt
1.17
iev
1.01
inyl
0.99
leneck
0.88
zman
0.85
rance
0.84
ievers
0.82
osis
0.78
rette
0.78
ritional
0.78
Activations Density 0.003%