INDEX
Explanations
words containing "ind" followed by a high activation value of 9 or 10
occurrences of the substring "ind" in words
New Auto-Interp
Negative Logits
OPLE
-0.79
sonian
-0.74
MQ
-0.73
BuyableInstoreAndOnline
-0.72
veyard
-0.72
ffen
-0.69
zzo
-0.68
Fenrir
-0.67
anwhile
-0.66
cca
-0.66
POSITIVE LOGITS
ented
1.08
etermin
1.04
ivid
1.04
ind
0.98
irection
0.94
ents
0.92
oled
0.90
irect
0.89
ocument
0.87
icol
0.87
Activations Density 0.008%