INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.06
1:0.11
2:0.10
3:0.08
4:0.08
5:0.07
6:0.08
7:0.08
8:0.07
9:0.08
10:0.07
11:0.07
Negative Logits
Irwin
-1.66
alls
-1.60
Nug
-1.54
Accessed
-1.54
Interior
-1.53
Chim
-1.53
Rober
-1.48
lishing
-1.46
Kemp
-1.46
inherit
-1.45
POSITIVE LOGITS
learners
1.74
isite
1.68
heid
1.65
veter
1.65
competence
1.61
halla
1.59
hemisphere
1.58
geries
1.57
lessons
1.56
behavi
1.55
Activations Density 0.000%
No Known Activations
This feature has no known activations.