INDEX
Explanations
references to a person named Carl with a high level of activation, potentially indicating a focus on identifying instances related to this person
mentions of the name "Carl."
New Auto-Interp
Negative Logits
lda
-0.71
HER
-0.69
PDATE
-0.69
nomine
-0.69
scant
-0.65
FY
-0.64
Blueprint
-0.63
Regulatory
-0.62
disposed
-0.61
Yoga
-0.61
POSITIVE LOGITS
isle
1.58
obal
1.01
izabeth
0.98
ota
0.96
otta
0.95
osaurus
0.94
inson
0.91
azar
0.91
XVI
0.89
onia
0.87
Activations Density 0.010%