INDEX
Explanations
specific propositions or statements
the word "that" in various contexts
New Auto-Interp
Negative Logits
izont
-0.66
Mane
-0.65
Jaguar
-0.64
Champ
-0.60
MX
-0.59
aukee
-0.59
Mi
-0.58
Tax
-0.58
throp
-0.58
IAS
-0.57
POSITIVE LOGITS
culminated
0.86
consequently
0.78
resulted
0.77
cher
0.74
fateful
0.72
secondly
0.71
contradicts
0.70
eatures
0.69
furthermore
0.69
reconc
0.69
Activations Density 0.109%