INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.08
3:0.08
4:0.09
5:0.09
6:0.09
7:0.08
8:0.07
9:0.07
10:0.07
11:0.09
Negative Logits
anooga
-1.84
swer
-1.65
kson
-1.64
trl
-1.59
laun
-1.55
enlarge
-1.54
verning
-1.53
ippi
-1.51
constitu
-1.50
ga
-1.49
POSITIVE LOGITS
Fram
1.85
Scene
1.72
Law
1.67
DOC
1.66
Rare
1.54
Certain
1.54
Attorney
1.53
market
1.53
VID
1.52
common
1.52
Activations Density 0.000%
No Known Activations
This feature has no known activations.