INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
grass
-0.76
iba
-0.76
hof
-0.71
ĸļ
-0.70
rye
-0.68
THC
-0.67
season
-0.66
imaru
-0.66
oven
-0.65
winner
-0.65
POSITIVE LOGITS
Cancel
0.85
property
0.70
Strikes
0.65
yrights
0.65
ITED
0.62
rhetorical
0.61
expiration
0.60
Cance
0.60
istg
0.60
ambiguous
0.59
Activations Density 0.000%
No Known Activations
This feature has no known activations.