INDEX
Explanations
words related to melting, dissolution, or disintegration
references to the concept of melting
New Auto-Interp
Negative Logits
Assass
-0.66
Guard
-0.62
ataka
-0.62
ours
-0.61
CHO
-0.60
udence
-0.58
Vanguard
-0.58
eur
-0.57
Belg
-0.56
vernment
-0.56
POSITIVE LOGITS
downs
1.08
butter
0.98
melted
0.89
glaciers
0.85
melt
0.83
ice
0.82
melting
0.81
down
0.78
ember
0.78
snow
0.78
Activations Density 0.034%