INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.05
2:0.08
3:0.08
4:0.09
5:0.08
6:0.07
7:0.08
8:0.08
9:0.09
10:0.08
11:0.08
Negative Logits
ridor
-1.87
ase
-1.69
yden
-1.69
ildo
-1.64
Sat
-1.62
emi
-1.58
idon
-1.55
nell
-1.55
Romney
-1.52
Allah
-1.52
POSITIVE LOGITS
Emblem
1.78
Attributes
1.74
Agric
1.59
ré
1.57
Tact
1.57
Strongh
1.56
Templar
1.53
playbook
1.51
Practices
1.49
Compass
1.48
Activations Density 0.000%
No Known Activations
This feature has no known activations.