INDEX
Explanations
phrases indicating a rephrasing or further explanation of content
phrases that introduce paraphrased or rephrased statements
New Auto-Interp
Negative Logits
adium
-0.68
ousel
-0.66
heaviest
-0.66
Columb
-0.65
Lima
-0.64
sund
-0.61
lecturer
-0.60
Clintons
-0.59
intermediary
-0.59
cent
-0.58
POSITIVE LOGITS
forth
0.92
hots
0.83
guiActiveUn
0.77
phony
0.77
atility
0.77
heses
0.77
mith
0.76
tainment
0.74
ername
0.74
oire
0.73
Activations Density 0.027%