INDEX
Explanations
phrases comparing two different elements using the term 'as'
comparisons or similes expressed with the word "as."
New Auto-Interp
Negative Logits
DIT
-0.74
å§«
-0.71
åĪ
-0.67
GES
-0.62
ASE
-0.61
SHIP
-0.59
Firstly
-0.59
Classes
-0.58
decrease
-0.58
ries
-0.58
POSITIVE LOGITS
pired
1.21
bestos
0.99
phalt
0.95
par
0.94
ylum
0.94
phy
0.92
piring
0.91
pir
0.90
pires
0.89
usual
0.87
Activations Density 0.069%