--- a +++ b/README.md @@ -0,0 +1,154 @@ +<div class="sc-cmRAlD dkqmWS"><div class="sc-UEtKG dGqiYy sc-flttKd cguEtd"><div class="sc-fqwslf gsqkEc"><div class="sc-cBQMlg kAHhUk"><h2 class="sc-dcKlJK sc-cVttbi gqEuPW ksnHgj">About Dataset</h2></div></div></div><div class="sc-jgvlka jFuPjz"><div class="sc-gzqKSP tNtjD"><div style="min-height: 80px;"><div class="sc-etVRix jqYJaa sc-bMmLMY ZURWJ"><pre class="uc-code-block"><code><span class="hljs-built_in">Data</span> <span class="hljs-keyword">on</span> recurrences of bladder cancer, used <span class="hljs-keyword">by</span> many people <span class="hljs-keyword">to</span> demonstrate methodology for recurrent event modelling. +</code><div class="uc-code-block-copy-button-wrapper"><button class="uc-code-block-copy-button google-symbols" aria-label="Copy code">content_copy</button></div></pre> +<table> +<thead> +<tr> +<th>Column</th> +<th>Description</th> +<th>Format</th> +</tr> +</thead> +<tbody> +<tr> +<td><strong>Bladder Dataset 1</strong></td> +<td></td> +<td></td> +</tr> +<tr> +<td>id</td> +<td>Patient ID</td> +<td></td> +</tr> +<tr> +<td>treatment</td> +<td>Treatment received</td> +<td>Placebo, pyridoxine (vitamin B6), or thiotepa</td> +</tr> +<tr> +<td>number</td> +<td>Initial number of tumors</td> +<td>8=8 or more</td> +</tr> +<tr> +<td>size</td> +<td>Size (cm) of largest initial tumor</td> +<td></td> +</tr> +<tr> +<td>recur</td> +<td>Number of recurrences</td> +<td></td> +</tr> +<tr> +<td>start</td> +<td>Start time of each interval</td> +<td></td> +</tr> +<tr> +<td>stop</td> +<td>End time of each interval</td> +<td></td> +</tr> +<tr> +<td>status</td> +<td>End of interval code</td> +<td>0=censored, 1=recurrence, 2=death from bladder disease, 3=death other/unknown cause</td> +</tr> +<tr> +<td>rtumor</td> +<td>Number of tumors found at recurrence</td> +<td></td> +</tr> +<tr> +<td>rsize</td> +<td>Size of largest tumor at recurrence</td> +<td></td> +</tr> +<tr> +<td>enum</td> +<td>Event number (observation number within patient)</td> +<td></td> +</tr> +<tr> +<td><strong>Bladder Dataset 0</strong></td> +<td></td> +<td></td> +</tr> +<tr> +<td>id</td> +<td>Patient ID</td> +<td></td> +</tr> +<tr> +<td>rx</td> +<td>Treatment received</td> +<td>1=placebo, 2=thiotepa</td> +</tr> +<tr> +<td>number</td> +<td>Initial number of tumors</td> +<td>8=8 or more</td> +</tr> +<tr> +<td>size</td> +<td>Size (cm) of largest initial tumor</td> +<td></td> +</tr> +<tr> +<td>stop</td> +<td>Recurrence or censoring time</td> +<td></td> +</tr> +<tr> +<td>enum</td> +<td>Which recurrence (up to 4)</td> +<td></td> +</tr> +<tr> +<td><strong>Bladder Dataset 2</strong></td> +<td></td> +<td></td> +</tr> +<tr> +<td>id</td> +<td>Patient ID</td> +<td></td> +</tr> +<tr> +<td>rx</td> +<td>Treatment received</td> +<td>1=placebo, 2=thiotepa</td> +</tr> +<tr> +<td>number</td> +<td>Initial number of tumors</td> +<td>8=8 or more</td> +</tr> +<tr> +<td>size</td> +<td>Size (cm) of largest initial tumor</td> +<td></td> +</tr> +<tr> +<td>start</td> +<td>Start of interval (0 or previous recurrence time)</td> +<td></td> +</tr> +<tr> +<td>stop</td> +<td>Recurrence or censoring time</td> +<td></td> +</tr> +<tr> +<td>enum</td> +<td>Which recurrence (up to 4)</td> +<td></td> +</tr> +</tbody> +</table> +<pre class="uc-code-block"><code>Bladder is <span class="hljs-keyword">the</span> data <span class="hljs-built_in">set</span> that appears most commonly <span class="hljs-keyword">in</span> <span class="hljs-keyword">the</span> literature. It uses only <span class="hljs-keyword">the</span> <span class="hljs-number">85</span> subjects <span class="hljs-keyword">with</span> nonzero follow-up who were assigned <span class="hljs-built_in">to</span> either thiotepa <span class="hljs-keyword">or</span> placebo, <span class="hljs-keyword">and</span> only <span class="hljs-keyword">the</span> <span class="hljs-keyword">first</span> <span class="hljs-literal">four</span> recurrences <span class="hljs-keyword">for</span> <span class="hljs-keyword">any</span> patient. The status <span class="hljs-built_in">variable</span> is <span class="hljs-number">1</span> <span class="hljs-keyword">for</span> recurrence <span class="hljs-keyword">and</span> <span class="hljs-number">0</span> <span class="hljs-keyword">for</span> everything <span class="hljs-keyword">else</span> (including death <span class="hljs-keyword">for</span> <span class="hljs-keyword">any</span> reason). The data <span class="hljs-built_in">set</span> is laid out <span class="hljs-keyword">in</span> <span class="hljs-keyword">the</span> competing risks <span class="hljs-built_in">format</span> <span class="hljs-keyword">of</span> <span class="hljs-keyword">the</span> paper <span class="hljs-keyword">by</span> Wei, Lin, <span class="hljs-keyword">and</span> Weissfeld. +</code><div class="uc-code-block-copy-button-wrapper"><button class="uc-code-block-copy-button google-symbols" aria-label="Copy code">content_copy</button></div></pre> +<pre class="uc-code-block"><code>Bladder1 is <span class="hljs-keyword">the</span> full data <span class="hljs-built_in">set</span> <span class="hljs-built_in">from</span> <span class="hljs-keyword">the</span> study. It <span class="hljs-keyword">contains</span> all <span class="hljs-literal">three</span> treatment arms <span class="hljs-keyword">and</span> all recurrences <span class="hljs-keyword">for</span> <span class="hljs-number">118</span> subjects; <span class="hljs-keyword">the</span> maximum observed <span class="hljs-built_in">number</span> <span class="hljs-keyword">of</span> recurrences is <span class="hljs-number">9.</span> +</code><div class="uc-code-block-copy-button-wrapper"><button class="uc-code-block-copy-button google-symbols" aria-label="Copy code">content_copy</button></div></pre> +<pre class="uc-code-block"><code>Bladder2 uses <span class="hljs-keyword">the</span> same subset <span class="hljs-keyword">of</span> subjects <span class="hljs-keyword">as</span> bladder, but formatted <span class="hljs-keyword">in</span> <span class="hljs-keyword">the</span> (<span class="hljs-built_in">start</span>, <span class="hljs-built_in">stop</span>] <span class="hljs-keyword">or</span> Anderson-Gill style. Note that <span class="hljs-keyword">in</span> transforming <span class="hljs-built_in">from</span> <span class="hljs-keyword">the</span> WLW <span class="hljs-built_in">to</span> <span class="hljs-keyword">the</span> AG style data <span class="hljs-built_in">set</span> there is <span class="hljs-keyword">a</span> quite common programming mistake that leads <span class="hljs-built_in">to</span> extra follow-up <span class="hljs-built_in">time</span> <span class="hljs-keyword">for</span> <span class="hljs-number">12</span> subjects: all those <span class="hljs-keyword">with</span> follow-up beyond their <span class="hljs-number">4</span>th recurrence. This <span class="hljs-string">"follow-up"</span> is <span class="hljs-keyword">a</span> side effect <span class="hljs-keyword">of</span> throwing away all events <span class="hljs-keyword">after</span> <span class="hljs-keyword">the</span> <span class="hljs-keyword">fourth</span> <span class="hljs-keyword">while</span> retaining <span class="hljs-keyword">the</span> <span class="hljs-keyword">last</span> follow-up <span class="hljs-built_in">time</span> <span class="hljs-built_in">variable</span> <span class="hljs-built_in">from</span> <span class="hljs-keyword">the</span> original data. The bladder2 data <span class="hljs-built_in">set</span> found here does <span class="hljs-keyword">not</span> make this mistake, but some analyses <span class="hljs-keyword">in</span> <span class="hljs-keyword">the</span> literature have done so; <span class="hljs-keyword">it</span> results <span class="hljs-keyword">in</span> <span class="hljs-keyword">the</span> addition <span class="hljs-keyword">of</span> <span class="hljs-keyword">a</span> small amount <span class="hljs-keyword">of</span> immortal <span class="hljs-built_in">time</span> bias <span class="hljs-keyword">and</span> shrinks <span class="hljs-keyword">the</span> fitted coefficients towards <span class="hljs-literal">zero</span>. +</code><div class="uc-code-block-copy-button-wrapper"> \ No newline at end of file