INDEX
Explanations
occurrences of the term "individual" in various contexts
New Auto-Interp
Negative Logits
fur
-0.16
furt
-0.16
ibox
-0.16
odian
-0.16
åı£
-0.15
лÑıн
-0.15
igan
-0.15
uros
-0.14
iliar
-0.14
edir
-0.14
POSITIVE LOGITS
individual
0.20
ity
0.19
jednotliv
0.18
ately
0.17
Individual
0.17
swith
0.17
ts
0.16
ized
0.16
arity
0.15
zed
0.15
Activations Density 0.026%