INDEX
Explanations
specific instances or events
the repeated phrase "there was" in various contexts
New Auto-Interp
Negative Logits
arta
-0.83
owe
-0.73
mare
-0.72
lish
-0.65
coin
-0.62
otic
-0.62
anium
-0.61
®
-0.60
allows
-0.60
Nation
-0.59
POSITIVE LOGITS
pandemonium
0.91
uproar
0.83
emonium
0.80
murm
0.77
OOL
0.73
outcry
0.72
reluct
0.71
hes
0.70
plenty
0.67
whispers
0.65
Activations Density 0.089%