INDEX
Explanations
references to external or outside factors influencing a situation
New Auto-Interp
Negative Logits
ruk
-0.17
hole
-0.17
ervo
-0.16
.createClass
-0.16
886
-0.16
creen
-0.15
ÑĤен
-0.15
ead
-0.15
esen
-0.15
outsider
-0.15
POSITIVE LOGITS
/internal
0.45
/Internal
0.37
-facing
0.25
ities
0.23
influences
0.22
factors
0.19
parties
0.19
wear
0.18
uber
0.18
/in
0.17
Activations Density 0.028%