INDEX
Explanations
expressions of gratitude and support from people
conjunctions and words indicating collective experiences or actions
New Auto-Interp
Negative Logits
Enlarge
-0.67
":"/
-0.66
Prohibition
-0.63
":["
-0.60
null
-0.60
DX
-0.59
rine
-0.58
2020
-0.58
ENSE
-0.58
aq
-0.57
POSITIVE LOGITS
been
1.50
gotten
1.30
been
1.23
gone
1.22
risen
1.07
eaten
1.05
fallen
1.03
begun
1.00
undergone
0.99
done
0.98
Activations Density 0.499%