Archive Page
Gastrointestinal Archive










SLIDES & TRANSCRIPTS
Tuesday, March 6

Clinical Trial Designs for Target-Directed Therapies: Clinical Trial Designs for Target-Directed Therapies
Gary L. Rosner, ScD

Slide 1: Statistical Issues In Study Design

I would like to thank the organizers, as well, for inviting me. It has been a very good learning experience. I am a statistician and I am going to talk about some statistical issues that relate to designing studies where you have particular molecular targets that you have identified.
You think that your drug is going to affect these, and now you want to go and do a study, and these are some of the considerations that you need to address in your study design.

TOP

Slide 2: Molecular Markers/Targets

In particular, one needs to consider how you are going to measure the effect, obviously, and I will talk more about that.
As Rich very well delineated, there are different ways in which we might use the term surrogate, but ultimately the challenge is to show that, whatever effect you have on a particular target, correlates with an overall clinical benefit.

TOP

Slide 3: Molecular Targets

So, we have heard lots today about different possible targets that have been identified in terms of the initiation, in terms of angiogenesis, metastasis and some of the different receptors and targets that have been identified.

TOP

Slide 4: Heterogeneity

The first thing, though -- we have also heard a lot about heterogeneity, but a lot of different types of heterogeneity.
What do we mean by heterogeneity? One could say that these two slides, perhaps the overall fraction of the area that is labeled red, suppose that is about the same in the two. In this case, it is pretty well spread out across the whole slide. Here, it is quite focused to just one half. Is that an example of heterogeneity that we need to consider? Well, if we are doing simple biopsies, then obviously it matters, because on the one hand, having a transect this way will still give us a representative sample in the sense that the distribution of the marker of the target within that core is similar to the distribution in the whole slide. However, here is depends upon whether one goes across this way or goes vertically. So, that is an example of heterogeneity that we perhaps need to consider. That may argue for doing more than one biopsy.

TOP

Slide 5: Within-tumor

There is also heterogeneity within a tumor. Patterns change across tumors. So, a slide taken from this part of the tumor may display this sort of variation in how the targets are distributed, whereas a later section may have a different distribution.

TOP

Slide 6: Measuring Hypoxia

As an example, I am going to talk about some work that I did with some colleagues at the North Carolina State University College of Veterinary Medicine and also the University of North Carolina. I was at Duke at the time, so we had all triangle universities involved. Here, we are measuring hypoxia. One of the advantages of collaborating with people at NC State in the vet school is, we had access to pet dogs who came down with cancer. The dogs were brought in by the owners for treatment. We were able to excise tumors, but also do some clinical studies on these dogs.
In particular, here we were studying the distribution of hypoxia within canine sarcomas. Jim Raleigh, at the University of North Carolina, had developed several different immunohistochemical markers for hypoxia. Hypoxia is of interest because, as I think has been said already, hypoxia is associated with resistance. Hypoxic cells are resistant to radiation. The question is how common is hypoxia in spontaneously-occurring tumors and how well can we estimate the fraction of cells that are hypoxic within a tumor.

TOP

Slide 7: Biopsy Study

For this particular study, we had seven tumors. We had two pretreatment biopsies within each tumor. Each biopsy was then sectioned and we could evaluate the sections with respect to the immunohistochemical marker, dye, or staining, for hypoxia.
So, we could look at within-biopsy variation in percent of cells that are hypoxic. We could also look at between biopsy variation, and also we can look at differences between the different tumors, the different dogs.

TOP

Slide 8: Methods

As the method for this study, what we did is, we took all the available data for a given biopsy and assumed that that was, in fact, the true percent of cells that were hypoxic within that biopsy. As you see, it ranged from 11 to 27 slides per biopsy. The median was 19.

TOP

Slide 9: Tumor Sections

Then, we would select at random either one slide per biopsy, two slides per biopsy, or four, as separate simulation studies, estimate the fraction hypoxic from either the one, the two or the four slides, and compare it to what we were calling the true state of hypoxia for that particular biopsy.
The question is, how frequently were we within plus or minus .05 of the true fraction hypoxic. Could we do that well with just taking one slide per biopsy? Do we need to use two or do we need to use four slides per biopsy? So, we sectioned off the tumors into slides, and this is an example slide. Within each slide, we would place a grid in the microscope eyepiece. Then we could measure hypoxia within each of the little grids.

TOP

Slide 10: Within-tumor Variability

What we see here are, for three dogs, that across the slides, there was some variability in hypoxic fraction. The variation, in fact, is higher if the frequency or the percent of cells that are hypoxic is higher.
If a tumor or a biopsy seems to have relatively little hypoxia, there is not as much variation as when there is more hypoxia.

TOP

Slide 11: Variance Increases

This is another picture that shows that, where the variance increases with the mean labeled fraction within the biopsy.

TOP

Slide 12: Fraction Within 0.05 By Number of Biopsies

So, this is something we need to consider. How many slides we take per biopsy depends to some extent on what the frequency is of whatever we are trying to measure. The greater the percent, it is going to be more variable. So, to have as precise a measure we are going to need more slides.

TOP

Slide 13: Study Results

The bottom line from this study, though, is that if we took four slides per biopsy, we were generally, 90 percent of the time, within plus or minus five percent. If we only looked at one slide, we did okay, as I said before, when it was relatively low, say 10 percent hypoxic. For the group of tumors that had much higher levels of hypoxia, one or two slides per biopsy didn't do such a great job.

TOP

Slide 14: Conclusions

This repeats what I just said.

TOP

Slide 15: Designing the Study

When we are designing a study, we need to think, what is the objective. Is the objective estimation? Is the objective prediction? Do we want to predict, for some future patient, something about the marker, or are we just trying to estimate in a cross section of the population?

TOP

Slide 16: Sources of Variation

Well, there are different sources of variation, as I indicated before, or as Emily also indicated. We have the within tumor or lesion heterogeneity. We also have between tumors but within patients, if there are multiple metastatic lesions. We need to consider that source of variations. There is also between patient heterogeneity.
Then the measurement methodology itself may be another source of variation or imprecision.

TOP

Slide 17: Measure

How we measure or how we count, if we use an immunohistochemical marker, also matters. Again, going back to the sarcomas, the canine sarcomas, we had two choices. If we had a section that we prepared into four slides, then do we want to sum up the total number of cells that were labeled across all four, and then divide that by the total number of cells counted, or do we want to average the fractions for each slide, and call that the measure of hypoxic fraction in this example. Similarly, do we want to go through and correct? Look at the slides, figure out the slides, figure out which of the cells are obviously not tumor cells, which are the ones that are tumor cells, and only divide our labeled fraction by dividing by the number of tumor cells. We can see that it has an effect on variation.

TOP

Slide 18: Between-biopsy and Within-biopsy

These show the two or however many biopsies that were available per dog for all seven dogs. You can see there is quite a bit of heterogeneity going across the dogs. The separate bars correspond to a biopsy, and then these box plots show the heterogeneity within a particular biopsy. That is just using for a denominator all the cells that were in the slide.

TOP

Slide 19: Variation

Here is a close up that will show it a little bit better. If we don't correct -- that is, we include all cells for this particular tumor -- these two biopsies are that far apart. If we correct, however, notice that the medians are much closer to each other. So, this has an effect on variability. Similarly, over here, uncorrected and corrected, the medians are much closer to each other across the biopsies.

TOP

Slide 20: Design Considerations

When we want to design a study, do we want to estimate with some pre-specified precision, say, the fraction labeled or even an effect that is a change, because of the treatment? Especially after what Emily Bergsland just said, there is a cost involved. There is an obvious cost in terms of the preparation of the slides, in terms of the patient and so on. Another cost that we need to consider is whether or not patients will agree to participate in this study. If taking a pretreatment biopsy is not something that is done routinely and is not necessarily a part of the patient's therapy, then that may have a very high cost associated with it, and perhaps we need to incorporate these kinds of costs when we design our studies. If we are evaluating two competing designs, we should probably go with the one that has less cost for the same precision, or at least weighs cost against precision.

TOP

Slide 21: Design Decisions

So, our decisions when designing a study are to choose the number of patients, the number of samples per patient, and the timing of the samples.

TOP

Slide 22: Utility Function

A simple way of incorporating cost is to take the precision of whatever you are trying to estimate -- say a treatment effect -- this should be minus the cost. So, you have a cost per patient times the number of patients. Then, for each patient you have a certain number of slides or biopsies times the cost.
If you are incorporating pretreatment biopsies, that cost may be much higher, or would be much higher, than if you are just going to go in and take samples at the time that the patient goes in for resection.

TOP

Slide 23: One Biopsy Per Subject

A little exercise, another simulation exercise, to kind of show how variation or heterogeneity impacts on clinical trial design, is the following: We will assume a particular model. The particular model would say that, for each individual, there is a distribution of what percent of the tumors are going to be labeled. So, there is a population from which we pick tumors that have different percent label. Then, for an individual, there is a distribution within that tumor of what fraction is labeled. So, the biopsies are random samples from, say, within an individual, but the individual is a random sample from this larger population, two sources of heterogeneity. We want to compare a trial where we are looking at a before and after sample to measure treatment effect, to a trial where we just randomize patients, some get the treatment, some not, and just look at the difference between these two groups at time of surgery. So, there is no pretreatment biopsy.

TOP

Slide 24: Between (BAR) Within

Just showing this heterogeneity, the bar graph, the histogram, shows what the population distribution was. There is, on average, 10 percent fraction here.
The individual densities, the lines, show within individual variation for four individuals chosen at random from this population. You can see that, even though the population is about 10 percent, there is heterogeneity between patients. This patient has a higher percent exhibiting the marker than this one. The variation is also higher for this patient than for this one.

TOP

Slide 25: Pre-Treatment Biopsy

How does this affect the clinical trial? Well, the power for the two types of designs, or within-subject design versus a between-patient or between-group design, will get to be about the same if the within-tumor or within-biopsy or within-patient variation is the same or larger as the between variation in this particular setting. The reason is that, as the variation increases, the correlation within a tumor, the correlation of these samples, decreases. If the correlation gets very small, you don't derive much benefit from a within-individual test.

TOP

Slide 26: Example

So, again, in this example we have 10 percent of the cells assumed positive. I also incorporated a threshold which just said if, when I generated this number at random, if it was below .01, it is below our level where we can detect it, so we will call it a zero. At random, 10 percent of the cells or sections or whatever within these simulated biopsies, were going to be zero. So, if you are measuring something like hypoxia, maybe some of the sections could be near a blood vessel. So, they would exhibit no hypoxia.
We are comparing here 20 patients within subject, so before and after clinical trial, to a 40-patient between-patient study. Just for simplicity, I am going to take one biopsy at each time. So, for the randomized trial it is one biopsy, say, at surgery, after treatment. For the within patient, it is a biopsy before and a biopsy after, and you just look at the difference.

TOP

Slide 27: Results

I have taken two situations here corresponding to roughly similar between-patient variation and within-patient variation. In this case, the between-patient variation is much less than the within-patient variation. Here, the within is smaller than between.
You see that, in general, for these two situations, both studies have roughly the same power to detect the difference, and the difference that I did in the simulation was just an absolute number of adding five percent to the fraction exhibiting whatever it is that you want, whatever you are trying to effect.
It is only in this situation where the within variation is less than the between, where you see the pretreatment biopsy improving the power.

TOP

Slide 28: Results

If the population average increases to 50 percent, then there is going to be much more heterogeneity. We saw that before with the hypoxia.
The tumors that had relatively little hypoxia, there was very little variation within a biopsy. The tumors that had more, 30 to 40 percent hypoxic fraction, had more variability. If the population average is now 50 percent, regardless of whether the between or within variation, one is larger than the other, you have very little power for the same sample size. In fact, even going up to 200 patients you still don't have 50 percent power in this particular example.
So, we need to know, when designing a study, how heterogeneous this quantity is, as well as what the mean fraction is.

TOP

Slide 29: Relative Cost Factors

Similarly, I think we need to incorporate costs in some way.

TOP

Slide 30: Summary

In summary, taking, in certain situations, a randomized trial that is only looking at an after treatment biopsy but comparing two groups may be a viable and a reasonable way to design a study, if you want to evaluate whether or not a particular marker is affected by a certain agent, a new compound. Taking multiple measurements per subject may help. Knowing something about the distribution of these things within the population of tumors, between lesions within patients and between patients, I think, is critical before we can really figure out what is the best design. Thank you.

TOP