INDEX
Explanations
the word "Things" or similar variations
mentions of the concept of "things," particularly in relation to changes or situations
New Auto-Interp
Negative Logits
iary
-0.81
gee
-0.74
pless
-0.74
irst
-0.69
onym
-0.69
asio
-0.66
adia
-0.66
bern
-0.65
ILE
-0.64
NES
-0.63
POSITIVE LOGITS
happened
1.09
transpired
1.08
happen
0.92
happening
0.89
Happ
0.88
happens
0.86
escalated
0.85
happ
0.81
bably
0.80
unfolded
0.79
Activations Density 0.039%