INDEX
Explanations
instances of the word "lur" and its variations
New Auto-Interp
Negative Logits
Errors
-0.71
enegger
-0.70
Fathers
-0.70
guiActiveUnfocused
-0.66
ortium
-0.66
Merchants
-0.65
Pwr
-0.65
Breaker
-0.64
Luther
-0.63
ELS
-0.63
POSITIVE LOGITS
uten
0.94
ker
0.88
ping
0.87
iously
0.86
aces
0.82
ormal
0.82
ace
0.80
kers
0.79
ge
0.79
ous
0.78
Activations Density 0.008%