INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.10
1:0.08
2:0.08
3:0.07
4:0.07
5:0.08
6:0.07
7:0.08
8:0.08
9:0.07
10:0.07
11:0.09
Negative Logits
atively
-2.04
ially
-1.95
wic
-1.92
kar
-1.86
hots
-1.84
nown
-1.82
tro
-1.82
NetMessage
-1.80
ocated
-1.80
dat
-1.79
POSITIVE LOGITS
buds
1.65
Ved
1.64
Birth
1.64
Nirvana
1.56
tongues
1.54
bandwagon
1.53
stature
1.53
Metatron
1.52
clarity
1.49
Cups
1.49
Activations Density 0.000%
No Known Activations
This feature has no known activations.