INDEX
Explanations
comparisons or explanations of a specific topic or situation comparisons
phrases that use "in that" to introduce explanations or justifications
New Auto-Interp
Negative Logits
asters
-0.80
ormons
-0.76
rior
-0.71
Guard
-0.71
inx
-0.64
srfAttach
-0.63
izont
-0.62
IDES
-0.62
quit
-0.61
ono
-0.61
POSITIVE LOGITS
vein
1.12
regard
1.02
particular
0.99
vicinity
0.97
same
0.92
timeframe
0.84
manner
0.83
circumstance
0.81
area
0.80
fateful
0.79
Activations Density 0.045%