INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.08
3:0.08
4:0.08
5:0.08
6:0.08
7:0.08
8:0.08
9:0.08
10:0.08
11:0.07
Negative Logits
────
-2.78
Wik
-2.76
OPER
-2.69
Citiz
-2.66
Correction
-2.58
================
-2.46
chuk
-2.46
scrib
-2.43
agre
-2.40
Wem
-2.40
POSITIVE LOGITS
crashes
3.00
Bloom
2.62
peaks
2.55
Katy
2.52
sidel
2.50
plateau
2.47
booming
2.46
garn
2.38
speeding
2.36
Clover
2.36
Activations Density 0.000%
No Known Activations
This feature has no known activations.