INDEX
Explanations
phrases related to reasons or causes
phrases that denote reasons or causes behind events or actions
New Auto-Interp
Negative Logits
Mario
-0.71
AIN
-0.68
ena
-0.65
Italy
-0.65
forestation
-0.64
enegger
-0.64
severe
-0.64
isha
-0.64
fox
-0.63
173
-0.63
POSITIVE LOGITS
differences
0.89
discrepancies
0.84
difference
0.84
workings
0.82
motivations
0.79
tendencies
0.79
priorities
0.77
similarities
0.77
characteristics
0.76
preferences
0.75
Activations Density 1.126%