INDEX
Explanations
names of people and their actions or roles
New Auto-Interp
Negative Logits
enis
-0.17
ÏĦιÏĥ
-0.16
RAY
-0.16
Petty
-0.16
aris
-0.15
ooks
-0.14
Lug
-0.14
en
-0.14
pie
-0.14
νια
-0.13
POSITIVE LOGITS
peg
0.16
æ°ı
0.15
.slim
0.15
bara
0.15
Scan
0.15
asions
0.15
HCI
0.14
SCAN
0.14
.Xaml
0.14
ãĤ§
0.14
Activations Density 0.190%