INDEX
Explanations
words related to critical importance or significance
references to critical importance
New Auto-Interp
Negative Logits
ander
-0.85
nery
-0.80
bows
-0.73
amm
-0.72
bow
-0.71
ilk
-0.70
á
-0.68
lance
-0.68
©¶æ
-0.68
uddin
-0.68
POSITIVE LOGITS
importance
0.87
wcs
0.83
crucial
0.79
ingredient
0.79
swing
0.78
components
0.77
ingred
0.76
onite
0.73
guiActiveUn
0.73
vital
0.71
Activations Density 0.014%