INDEX
Explanations
questions or inquiry-related phrases
rhetorical questions or phrases emphasizing inquiry
New Auto-Interp
Negative Logits
shore
-0.66
roads
-0.61
trop
-0.60
eer
-0.59
Gy
-0.59
idon
-0.59
println
-0.59
ulic
-0.58
ped
-0.58
gal
-0.57
POSITIVE LOGITS
soever
1.27
happens
1.12
happened
1.04
transpired
0.97
distinguishes
0.96
happ
0.90
else
0.85
ensued
0.84
constitutes
0.83
separates
0.83
Activations Density 0.086%