INDEX
Explanations
phrases related to actions and consequences
statements related to causes and consequences
New Auto-Interp
Negative Logits
badass
-0.92
!'"
-0.92
goddamn
-0.91
.'"
-0.91
,'"
-0.85
yours
-0.85
!'
-0.84
?'"
-0.83
ain
-0.81
marvelous
-0.79
POSITIVE LOGITS
apologised
1.07
licences
1.06
emphas
1.04
analys
1.01
recognised
1.00
realised
0.99
organisers
0.98
counselling
0.98
organis
0.96
organisations
0.95
Activations Density 0.968%