INDEX
Explanations
phrases that indicate consecutive occurrences or continuity, emphasizing repetition over time
phrases indicating sequential occurrences or repetitions
New Auto-Interp
Negative Logits
ufact
-0.70
sacrific
-0.69
merce
-0.65
itars
-0.64
streng
-0.62
arus
-0.61
Marginal
-0.60
irs
-0.58
rament
-0.58
abilia
-0.57
POSITIVE LOGITS
Fla
0.80
dies
0.69
..........
0.67
Benz
0.66
agy
0.66
ago
0.65
forth
0.63
Nieto
0.62
adia
0.62
TPS
0.62
Activations Density 0.015%