INDEX
Explanations
phrases related to specific actions or events involving people and locations
New Auto-Interp
Negative Logits
ummies
-0.16
Nath
-0.15
division
-0.14
oretical
-0.14
iction
-0.14
anie
-0.14
ishops
-0.14
hani
-0.14
semblies
-0.14
ondo
-0.14
POSITIVE LOGITS
äºĨä¸Ģ
0.22
ifies
0.20
ablish
0.20
ÏĦαι
0.20
ulates
0.19
ibrate
0.19
好äºĨ
0.18
äºĨ
0.18
lessly
0.18
inize
0.17
Activations Density 0.118%