INDEX
Explanations
phrases related to significant events or developments
phrases that indicate a list or items being presented
New Auto-Interp
Negative Logits
nels
-0.73
iffe
-0.72
rx
-0.68
ede
-0.68
each
-0.67
arry
-0.66
relies
-0.66
Measure
-0.65
emphasizes
-0.64
gyn
-0.64
POSITIVE LOGITS
obligatory
1.25
beginnings
1.24
usual
1.16
slightest
1.13
culmination
1.12
dreaded
1.10
finest
1.09
definitive
1.07
simplest
1.07
smallest
1.06
Activations Density 0.514%