INDEX
Explanations
phrases indicating confirmation or support of previous research or findings
confirmation or support of prior information
confirm or support previous findings
New Auto-Interp
Negative Logits
Diſ
-0.81
―――――
-0.81
greateſt
-0.81
Majefty
-0.81
Perſ
-0.79
ſelf
-0.76
ſelves
-0.74
preſent
-0.74
Conſ
-0.73
Jefus
-0.72
POSITIVE LOGITS
previous
0.92
previous
0.82
предыду
0.75
earlier
0.75
précédente
0.74
anteriores
0.71
précédentes
0.68
earlier
0.68
recent
0.68
Previous
0.64
Activations Density 0.851%