INDEX
Explanations
instances of the word "meaning" and its variations
New Auto-Interp
Negative Logits
ipa
-0.20
uggy
-0.17
eday
-0.16
azzi
-0.16
cano
-0.15
خاÙĨÙĩ
-0.15
edb
-0.15
rego
-0.15
zon
-0.14
istol
-0.14
POSITIVE LOGITS
fully
0.38
lessly
0.31
lessness
0.29
FUL
0.27
ful
0.26
ings
0.24
fulness
0.24
less
0.21
full
0.20
iful
0.18
Activations Density 0.028%