INDEX
Explanations
mentions of a specific person named Caitlyn
New Auto-Interp
Negative Logits
pinch
-0.71
ures
-0.68
ured
-0.68
ELD
-0.66
drm
-0.62
ItemTracker
-0.61
heimer
-0.61
defic
-0.61
Provided
-0.60
unification
-0.58
POSITIVE LOGITS
lyn
1.30
lin
1.20
Sith
0.94
lus
0.91
lan
0.88
ling
0.87
lynn
0.87
leton
0.83
riers
0.82
LIN
0.81
Activations Density 0.006%