INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.06
2:0.08
3:0.08
4:0.11
5:0.07
6:0.07
7:0.08
8:0.08
9:0.07
10:0.08
11:0.09
Negative Logits
Thompson
-1.63
beetles
-1.61
myster
-1.58
Stevenson
-1.54
auder
-1.53
Davidson
-1.52
methodological
-1.51
horm
-1.49
Fra
-1.47
spectator
-1.46
POSITIVE LOGITS
Assass
1.98
thood
1.87
20439
1.85
itures
1.78
referenced
1.70
=>
1.67
ignt
1.66
zbollah
1.65
translation
1.59
umbnail
1.56
Activations Density 0.000%
No Known Activations
This feature has no known activations.