INDEX
Explanations
adjectives or nouns related to harmful or risky situations
terms that indicate a sense of danger or harmfulness
New Auto-Interp
Negative Logits
arest
-0.81
Ħ¢
-0.78
olitan
-0.77
orthy
-0.75
edia
-0.75
ļéĨĴ
-0.75
ī
-0.75
apolis
-0.73
elle
-0.72
gdala
-0.72
POSITIVE LOGITS
combination
0.91
combinations
0.78
stunts
0.78
dangerous
0.78
endanger
0.77
snakes
0.77
situations
0.77
slope
0.76
adolesc
0.74
ening
0.74
Activations Density 0.056%