INDEX
Explanations
proper nouns related to various entities, particularly the name "Freeman" at different levels of activation
mentions of specific individuals, particularly someone named Freeman
New Auto-Interp
Negative Logits
ificial
-0.82
ģĸ
-0.77
aron
-0.75
alid
-0.74
rolog
-0.70
assic
-0.69
oiler
-0.68
ãĤ°
-0.67
benefit
-0.66
Mehran
-0.64
POSITIVE LOGITS
Freeman
0.80
ufact
0.73
lings
0.71
vernment
0.70
iquette
0.69
ason
0.68
burg
0.67
Duff
0.67
boro
0.66
Sketch
0.66
Activations Density 0.012%