INDEX
Explanations
references to old or outdated things within a context
New Auto-Interp
Negative Logits
utterstock
-0.93
ï¸ı
-0.85
ILCS
-0.83
ommod
-0.82
atography
-0.80
amera
-0.80
illation
-0.79
gur
-0.78
acca
-0.73
pard
-0.73
POSITIVE LOGITS
fashioned
1.41
timers
1.01
standby
0.97
est
0.93
fashioned
0.84
stomp
0.82
school
0.79
school
0.79
bies
0.78
guard
0.76
Activations Density 0.023%