INDEX
Explanations
activations in text sections that present technical or official information, possibly related to procedures or authorizations
occurrences of significant numerical values and dates
New Auto-Interp
Negative Logits
adolesc
-0.81
xual
-0.76
awaru
-0.72
estranged
-0.69
Gardens
-0.66
mur
-0.66
greenhouse
-0.62
Vik
-0.62
teen
-0.61
stomp
-0.61
POSITIVE LOGITS
However
1.03
Traditional
1.00
Unlike
0.99
Furthermore
0.97
Fortunately
0.96
Statistics
0.95
Since
0.95
Generally
0.94
During
0.93
Luckily
0.92
Activations Density 0.330%