INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.08
3:0.07
4:0.07
5:0.08
6:0.07
7:0.09
8:0.08
9:0.09
10:0.07
11:0.09
Negative Logits
Zoro
-3.28
Khe
-3.17
Syri
-3.02
Shinra
-2.96
synagogue
-2.91
Jakarta
-2.88
Mana
-2.83
Erit
-2.79
Elf
-2.79
Elf
-2.78
POSITIVE LOGITS
airflow
2.61
litigation
2.59
��
2.50
gate
2.48
Excellence
2.46
advers
2.44
rigorous
2.38
prod
2.38
advanced
2.32
urated
2.31
Activations Density 0.000%
No Known Activations
This feature has no known activations.