INDEX
Explanations
mentions of proper nouns or entities
occurrences of the abbreviation "AK" related to Alaska, frequently appearing in various contexts
New Auto-Interp
Negative Logits
weights
-0.75
Weasley
-0.72
istg
-0.69
guiIcon
-0.67
ãĤ£
-0.67
drops
-0.66
weight
-0.65
boosters
-0.60
guiActiveUn
-0.60
ãĤ¡
-0.59
POSITIVE LOGITS
AK
0.93
arak
0.89
AY
0.88
NESS
0.81
ansas
0.78
PLIC
0.78
ay
0.77
umar
0.77
ernel
0.77
AIN
0.76
Activations Density 0.003%