INDEX
Explanations
proper nouns or names, specifically "Keller"
mentions of specific names and organizations
New Auto-Interp
Negative Logits
ential
-0.90
Ö¼
-0.79
entials
-0.79
ENCE
-0.78
encia
-0.76
encies
-0.76
ences
-0.76
aud
-0.75
IBLE
-0.74
encers
-0.72
POSITIVE LOGITS
DPR
0.86
istani
0.81
Kat
0.79
kat
0.75
ratom
0.75
Kob
0.75
patrick
0.74
Silk
0.73
wagen
0.73
Helsinki
0.73
Activations Density 0.035%