INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.11
2:0.08
3:0.07
4:0.08
5:0.10
6:0.05
7:0.06
8:0.09
9:0.09
10:0.08
11:0.08
Negative Logits
dunno
-1.87
venture
-1.69
disemb
-1.60
correspondence
-1.54
ventures
-1.52
speculation
-1.52
speculative
-1.52
irresponsible
-1.49
intellectual
-1.48
exper
-1.47
POSITIVE LOGITS
yip
2.08
Ao
2.01
myra
1.75
Adin
1.73
onomy
1.64
[|
1.63
ゼウス
1.63
Lanka
1.62
Wan
1.58
ーティ
1.58
Activations Density 0.000%
No Known Activations
This feature has no known activations.