INDEX
Explanations
phrases or words with special text characters, such as â̏
instances of a specific character or mark (âĢ)
New Auto-Interp
Negative Logits
detached
-0.71
ozy
-0.63
berman
-0.63
lder
-0.62
fragmentation
-0.60
scatter
-0.59
Truman
-0.59
redistribution
-0.58
mosqu
-0.58
transfer
-0.58
POSITIVE LOGITS
âĸº
0.98
¹
0.98
ij
0.93
âĢ
0.92
ł
0.87
IJ
0.82
ª
0.81
âĢ
0.81
âĢł
0.80
COMPLE
0.80
Activations Density 0.205%