INDEX
Explanations
statements related to internal processes or operations
terms related to internal and external processes or actions
New Auto-Interp
Negative Logits
Parenthood
-0.85
Gamble
-0.79
Mehran
-0.77
Magikarp
-0.73
EVA
-0.72
Roses
-0.72
Reply
-0.70
ador
-0.68
Guy
-0.67
eer
-0.65
POSITIVE LOGITS
displaced
1.02
internally
1.00
combustion
0.84
identifiable
0.81
speaking
0.80
exting
0.80
housed
0.76
vernment
0.75
dispersed
0.74
combust
0.74
Activations Density 0.006%