INDEX
Explanations
words related to skiing
mentions of skiing or related activities
New Auto-Interp
Negative Logits
Conf
-0.68
privacy
-0.67
Privacy
-0.66
electr
-0.66
Ind
-0.63
act
-0.62
public
-0.61
loud
-0.59
label
-0.59
confront
-0.59
POSITIVE LOGITS
ski
4.70
sky
2.21
sk
1.73
icz
1.51
skip
1.34
ewski
1.26
SK
1.24
anski
1.15
arov
1.14
ki
1.12
Activations Density 0.013%