INDEX
Explanations
instances where a reference to a particular event or place is mentioned in relation to a topic
instances of the word "this."
New Auto-Interp
Negative Logits
letes
-0.74
acers
-0.73
trump
-0.70
ickets
-0.70
rac
-0.69
arthed
-0.67
mist
-0.67
onis
-0.66
Izan
-0.66
winning
-0.66
POSITIVE LOGITS
regard
1.17
vein
1.06
context
1.03
case
0.98
particular
0.97
circumstance
0.93
tutorial
0.92
manner
0.90
week
0.89
instance
0.88
Activations Density 0.051%