INDEX
Explanations
adjectives related to the quality of something
past tense verbs indicating actions or events
New Auto-Interp
Negative Logits
emption
-0.60
IE
-0.55
&
-0.53
ativity
-0.52
neighboring
-0.51
.*
-0.51
Warrant
-0.50
*/
-0.50
pree
-0.49
ILCS
-0.49
POSITIVE LOGITS
consisted
0.77
divided
0.69
varied
0.67
mainly
0.66
lasted
0.63
gradually
0.63
ktop
0.63
apologised
0.63
consist
0.61
travelled
0.61
Activations Density 0.378%