INDEX
Explanations
words related to spatial or opposing relationships
references to different sides or perspectives in a debate or conflict
New Auto-Interp
Negative Logits
endi
-0.84
ESCO
-0.80
yrinth
-0.76
erion
-0.73
untu
-0.72
itative
-0.71
Ĥİ
-0.68
yssey
-0.65
acan
-0.64
ogy
-0.64
POSITIVE LOGITS
kick
1.06
thereof
0.89
board
0.81
burn
0.80
side
0.77
boards
0.77
of
0.76
bars
0.71
Za
0.70
real
0.68
Activations Density 0.034%