INDEX
Explanations
natural disaster-related terms
New Auto-Interp
Negative Logits
iture
-0.79
Token
-0.74
Founding
-0.74
Ethics
-0.74
Employee
-0.72
JECT
-0.72
Lives
-0.71
Priv
-0.68
Intellectual
-0.67
prison
-0.66
POSITIVE LOGITS
gust
1.16
winds
1.12
blowing
1.05
blew
0.98
swept
0.96
Irma
0.96
raged
0.91
roared
0.89
roar
0.89
roaring
0.88
Activations Density 0.079%