INDEX
Explanations
mentions of a specific individual named Gil
New Auto-Interp
Negative Logits
ãĥªãĥ³ãĤ°
-0.16
+xml
-0.15
ãĥ¼ãĥĵ
-0.15
doors
-0.15
acht
-0.14
lit
-0.14
ophilia
-0.14
itelist
-0.14
lite
-0.14
вол
-0.14
POSITIVE LOGITS
iland
0.20
more
0.20
christ
0.19
bert
0.19
bson
0.17
crest
0.17
patrick
0.16
Medal
0.16
strap
0.16
git
0.16
Activations Density 0.011%