INDEX
Explanations
phrases related to acceptance and tolerance towards various concepts
New Auto-Interp
Negative Logits
mingen
-0.69
virons
-0.67
Autowired
-0.64
avrebbero
-0.64
Barker
-0.63
toolbox
-0.63
devriez
-0.63
supra
-0.62
Lub
-0.62
Иль
-0.62
POSITIVE LOGITS
accept
2.42
Accept
2.27
accepts
2.25
acceptance
2.23
accepting
2.17
accepted
2.16
ACCEPT
2.13
Accepting
2.12
accept
2.08
Acceptance
2.08
Activations Density 0.071%