INDEX
Explanations
references to comparisons or similarities
New Auto-Interp
Negative Logits
houſe
-1.15
raiſ
-1.10
Houſe
-1.05
Theſe
-1.03
ſeveral
-1.01
itſelf
-1.01
myſelf
-1.01
ſet
-1.00
againſt
-0.98
pleaſure
-0.96
POSITIVE LOGITS
ษัท
0.62
’
0.58
Co
0.57
cupertino
0.56
0.55
ไ
0.55
’
0.52
0.51
•
0.50
ArgsConstructor
0.50
Activations Density 0.140%