OSA's Digital Library

Biomedical Optics Express

Biomedical Optics Express

  • Editor: Joseph A. Izatt
  • Vol. 4, Iss. 7 — Jul. 1, 2013
  • pp: 1104–1118
« Show journal navigation

Direct identification of breast cancer pathologies using blind separation of label-free localized reflectance measurements

Alma Eguizabal, Ashley M. Laughney, Pilar Beatriz García-Allende, Venkataramanan Krishnaswamy, Wendy A. Wells, Keith D. Paulsen, Brian W. Pogue, Jose M. Lopez-Higuera, and Olga M. Conde  »View Author Affiliations


Biomedical Optics Express, Vol. 4, Issue 7, pp. 1104-1118 (2013)
http://dx.doi.org/10.1364/BOE.4.001104


View Full Text Article

Acrobat PDF (3964 KB)





Browse Journals / Lookup Meetings

Browse by Journal and Year


   


Lookup Conference Papers

Close Browse Journals / Lookup Meetings

Article Tools

Share
Citations

Abstract

Breast tumors are blindly identified using Principal (PCA) and Independent Component Analysis (ICA) of localized reflectance measurements. No assumption of a particular theoretical model for the reflectance needs to be made, while the resulting features are proven to have discriminative power of breast pathologies. Normal, benign and malignant breast tissue types in lumpectomy specimens were imaged ex vivo and a surgeon-guided calibration of the system is proposed to overcome the limitations of the blind analysis. A simple, fast and linear classifier has been proposed where no training information is required for the diagnosis. A set of 29 breast tissue specimens have been diagnosed with a sensitivity of 96% and specificity of 95% when discriminating benign from malignant pathologies. The proposed hybrid combination PCA-ICA enhanced diagnostic discrimination, providing tumor probability maps, and intermediate PCA parameters reflected tissue optical properties.

© 2013 OSA

1. Introduction

Breast cancer continues to be the most diagnosed cancer among women, comprising 23% of all female cancers. Non-invasive small lesions, detected at an early-stage, however, can be treated successfully with breast conserving therapy (BCT), which includes local tumor excision followed by moderate-dose radiation therapy [1

1. World Health Organization (2008), http://www.who.int/en/

]. Early invasive breast cancers (stage I and stage II) have, however, a high risk of reoccurrence when residual disease is left at or near the cut edge. In fact, BCT has been demonstrated to be equally effective as mastectomy only when no residual disease is left on margins [2

2. F. Fitzal, O. Riedl, and R. Jakesz, “Recent developments in breast-conserving surgery for breast cancer patients,” Langenbecks Arch. Surg. 394(4), 591–609 (2009). [CrossRef] [PubMed]

], thereby minimizing the need for more a more radical therapy like mastectomy. Despite its therapeutic predictive value, most studies report high variability in the number of patients treated with BCT with residual disease, demonstrating a lack of standardization for margin delineation [3

3. R. G. Pleijhuis, M. Graafland, J. de Vries, J. Bart, J. S. de Jong, and G. M. van Dam, “Obtaining adequate surgical margins in breast-conserving therapy for patients with early-stage breast cancer: current modalities and future directions,” Ann. Surg. Oncol. 16(10), 2717–2730 (2009). [CrossRef] [PubMed]

].

Light scattering spectroscopy has been applied broadly to identify residual disease in resected breast tissues by detecting changes in the scattering spectrum induced by morphological variations in the size and number of density of cells and the tissue extra-cellular matrix [4

4. S. Srinivasan, B. W. Pogue, S. Jiang, H. Dehghani, C. Kogel, S. Soho, J. J. Gibson, T. D. Tosteson, S. P. Poplack, and K. D. Paulsen, “Interpreting hemoglobin and water concentration, oxygen saturation, and scattering measured in vivo by near-infrared breast tomography,” Proc. Natl. Acad. Sci. U.S.A. 100(21), 12349–12354 (2003). [CrossRef] [PubMed]

]. Natural heterogeneity in light scattering from tissue morphology has been observed and its spatial distribution can be used to improve discrimination between tissue subtypes [5

5. A. M. Laughney, V. Krishnaswamy, E. J. Rizzo, M. C. Schwab, R. J. Barth, B. W. Pogue, K. D. Paulsen, and W. A. Wells, “Scatter spectroscopic imaging distinguishes between breast pathologies in tissues relevant to surgical margin assessment,” Clin. Cancer Res. 18(22), 6315–6325 (2012). [CrossRef] [PubMed]

]. Consequently, the scanning spectroscopy system demonstrated in [6

6. V. Krishnaswamy, P. J. Hoopes, K. S. Samkoe, J. A. O’Hara, T. Hasan, and B. W. Pogue, “Quantitative imaging of scattering changes associated with epithelial proliferation, necrosis, and fibrosis in tumors using microsampling reflectance spectroscopy,” J. Biomed. Opt. 14(1), 014004 (2009). [CrossRef] [PubMed]

] has been designed to be maximally sensitive to elastic scattering, although some partial coupling with hemoglobin absorption has been observed. Signal localization is employed in the illumination and detection paths to preserve the weakly scattered spectrum. Optical properties, namely the reduced scattering and absorption coefficients, are traditionally parameterized according to theoretical models of light scattering in turbid media. These models are valid when specific, physical conditions are met in the data acquisition geometry [7

7. S. C. Kanick, H. J. C. M. Sterenborg, and A. Amelink, “Empirical model of the photon path length for a single fiber reflectance spectroscopy device,” Opt. Express 17(2), 860–871 (2009). [CrossRef] [PubMed]

]. The full accomplishment of these conditions is sometimes impossible to fulfill, yielding uncertainty in the separation of absorption and scattering signatures. Typically, analytical solution for the problem of diffuse reflectance from turbid media such as biological tissues only exists for idealized systems, like a point source in a semi-infinite medium [8

8. G. Zonios and A. Dimou, “Modeling diffuse reflectance from homogeneous semi-infinite turbid media for biological tissue applications: a Monte Carlo study,” Biomed. Opt. Express 2(12), 3284–3294 (2011). [CrossRef] [PubMed]

]. Models assume light incidence on an optically homogenous medium, which is also only approximate for biological tissues. Furthermore, single-fiber reflectance measurements do not accurately recover the photon pathlength, limiting absolute quantification of optical parameters [9

9. S. L. Jacques and S. Prahl, Oregon Medical Laser Center (2010).

]. Consequently, the existence of a model-free approach would be a great asset.

Blind Signal Separation (BSS) is a set of signal processing techniques able to decouple information arising from multiple sources. Consequently, they can be employed to decouple the information generated by absorption and scattering in tissue for unique acquisition geometries. These methods have been extensively used for removing interference and noise or for feature extraction from optical signals [10

10. J. Glatz, N. C. Deliolanis, A. Buehler, D. Razansky, and V. Ntziachristos, “Blind source unmixing in multi-spectral optoacoustic tomography,” Opt. Express 19(4), 3175–3184 (2011). [CrossRef] [PubMed]

13

13. J. Virtanen, T. Noponen, and P. Meriläinen, “Comparison of principal and independent component analysis in removing extracerebral interference from near-infrared spectroscopy signals,” J. Biomed. Opt. 14(5), 054032 (2009). [CrossRef] [PubMed]

]. Principal Component Analysis (PCA) performs a change of basis and finds new uncorrelated projections, while Independent Component Analysis (ICA) creates a new independent feature space, with more statistical separation than uncorrelation [14

14. J. L. Semmlow, Biosignal and Biomedical Image Processing: MATLAB-Based Applications (CRC Press, 2004), Chap. 9.

]. The validity of ICA has been previously demonstrated in diverse scenarios. It has been proven to enhance classification features from mammograms for breast cancer detection [15

15. R. Gallardo-Caballero, C. J. García-Orellana, H. M. González-Velasco, and M. Macías-Macías, “Independent component analysis applied to detection of early breast cancer signs,” in Proceeding of 9th International Work-Conference on Artificial Neural Networks (IWANN, San Sebastian, Spain, 2007), pp. 988–995.

] and also for other cancer types: maps of tumor probability have been extracted from the ICA of RGB fluorescence images taken from the skin [16

16. I. Kopriva and A. Peršin, “Unsupervised decomposition of low-intensity low-dimensional multi-spectral fluorescent images for tumour demarcation,” Med. Image Anal. 13(3), 507–518 (2009). [CrossRef] [PubMed]

]. PCA is typically employed to reduce data dimensionality and to enhance performance of the independent ICA algorithms [17

17. F. Abu-Amara and I. Abdel-Qader, “Detection of breast cancer using independent component analysis,” in Proceedings of IEEE International Conference on Electro/Information Technology (Institute of Electrical and Electronics Engineers, New York, 2007), pp.428–431. [CrossRef]

]. PCA and ICA have been effectively applied to un-mix distinct exogenous fluorophores in multispectral opto-acoustic tomography data [10

10. J. Glatz, N. C. Deliolanis, A. Buehler, D. Razansky, and V. Ntziachristos, “Blind source unmixing in multi-spectral optoacoustic tomography,” Opt. Express 19(4), 3175–3184 (2011). [CrossRef] [PubMed]

] and also for removing or studying blood absorbance from NIRS signals [11

11. I. Schelkanova and V. Toronov, “Independent component analysis of broadband near-infrared spectroscopy data acquired on adult human head,” Biomed. Opt. Express 3(1), 64–74 (2012). [CrossRef] [PubMed]

,12

12. S. Kohno, I. Miyai, A. Seiyama, I. Oda, A. Ishikawa, S. Tsuneishi, T. Amita, and K. Shimizu, “Removal of the skin blood flow artifact in functional near-infrared spectroscopic imaging data through independent component analysis,” J. Biomed. Opt. 12(6), 062111 (2007). [CrossRef] [PubMed]

]. Nevertheless, the overall outperformance of ICA over PCA is still to be proven [13

13. J. Virtanen, T. Noponen, and P. Meriläinen, “Comparison of principal and independent component analysis in removing extracerebral interference from near-infrared spectroscopy signals,” J. Biomed. Opt. 14(5), 054032 (2009). [CrossRef] [PubMed]

].

Here, a feature-extraction method is presented to discriminate benign from malignant pathologies in resected breast tissue and its diagnostic performance is validated according to histology, the diagnostic gold standard. No analytical models are performed to extracted diagnostic components from the scattering spectrum. Instead, PCA is used to transform spectral data into an uncorrelated feature space that reduces data dimensionality and eliminates cross talk between hemoglobin absorption and scattering signatures. Then, ICA optimized by PCA is used to predict the breast tissue subtype. Finally, sign ambiguities associated with the BSS algorithm are solved by a user-guided, soft calibration process.

2. Materials and methods

2.1 Optical imaging data from breast tissues and the modeling of reflectance

Samples of freshly resected breast tissues acquired during breast conserving surgery, were obtained directly from the Department of Pathology at Dartmouth-Hitchcock Medical Center, when there was tissue in excess of that required to make a clinical diagnosis. Tissues were 1-2cm2 with a thickness of 3-5mm. Immediately after each imaging procedure, each sample with formalin-fixed and paraffin embedded, then stained with Hematoxylin and Eosin (H&E) for pathology correlation. 29 resected tissues were imaged and, on each specimen, several regions of interest (ROIs) were further evaluated by a pathologist for precise co-registration with optical maps. In total, 48 different ROIs were identified that were not uniform in size, having diameters from 500 um to 0.2 cm. Tissues were characterized as benign, malignant or adipose, as summarized in Table 1

Table 1. Regions of Interest (ROI) Diagnosed by the Pathologist on the 29 Specimens

table-icon
View This Table
| View All Tables
.

2.2 Multivariate linear analysis and BSS

Principal Component Analysis (PCA) and Independent Component Analysis (ICA) are linear processing techniques characterized by their simplicity and low computational load. Equation (2) shows the linear transformation that describes both processes.
sM×N=WM×M xM×N
(2)
where x comprises the reflectance information per tissue sample measured at N locations (approximately 4000 pixels or observations per sample in this case) and M different spectral bands (being M = 512); W is the mixing matrix that represents the linear operation to be applied on the original data x to provide s, that contains the multivariate data result with the decoupled mixed signals or scores i.e., the representation of the raw data x in the new component space.

2.2.1 Linear mixture of components

PCA and ICA assume linear mixtures. If neperian logarithm is applied on empirical expression in Eq. (1) a linear sum of the reflectance spectra parameters can be defined and then compared with PCA-ICA results, as shown on Eq. (4):
X(λ)=ln(R(λ))= ln(A)-b ln(λ) -ρ CHbT[fO2εHbO2(λ)-(1- fO2)εHb(λ)]= S1σ1(λ)+S2σ2(λ)+S3σ3(λ)
(4)
where X(λ) is the logarithm of reflectance R(λ)where the contributions to the spectra are linearly mixed. Interpreting the general expression, S1, S2 and S3 would be the linear weights that modulate the contribution of each spectral component, whereas each spectral behavior is represented by theσ1(λ),σ2(λ),σ3(λ) functions that could be directly associated with absorption and scattering features. The sum of all contributions would result into the initial reflectance spectrumX(λ).

2.2.2 PCA to uncorrelate components and compress spectral data

A criterion must then be established to decide these few number of maintained uncorrelated components from the initial M = 512 to L. The chosen criterion is to maintain L<M eigenvalues, with a joint variance above a specific threshold, as shown in Eq. (7).
Keptvariance=q=1LD(q,q)q=1MD(q,q)100(%)threshold
(7)
where D(q,q) is the qth eigenvalue of the covariance matrix C.

2.2.3 ICA to identify independent latent factors

Independent Component Analysis (ICA) is also a multivariate linear blind separator that uses higher order statistics, instead of covariance, to extract the new set of linearly unmixed components. Since statistical independence is a stronger condition than uncorrelation, more accurate maps of diagnosis can be obtained. This work assumes that exist malignancy tissue properties that are statistically independent from other tissue types such as normal or adipose. This hypothesis is based on the differences in absorbance and scattering generated by each tissue condition.

All measures of spectral reflectance could be used to discriminate between tissue types, but this is frequently not optimal and always computationally demanding [20

20. A. Hyvärinen, “Fast and robust fixed-point algorithms for independent component analysis,” IEEE Trans. Neural Netw. 10(3), 626–634 (1999). [CrossRef] [PubMed]

]. Here, PCA is proposed to reduce the data dimensionality [21

21. R. Bro, E. Acar, and T. G. Kolda, “Resolving the sign ambiguity in the singular value decomposition,” J. Chemometr. 22(2), 135–140 (2008). [CrossRef]

]. Consequently, the data arising from this pre-processing step can be analyzed with ICA. This is the reason why ICA results cannot be spectrally interpreted as in Eq. (4): the dimension of data now is not the spectral 512 components but the very few PCA pre-processed components. In fact, a similar situation as in Eq. (3) is faced, but now x, i.e. the detectors, are the uncorrelated components maintained after the PCA analysis, and s, i.e. the sources, will be the IC components, which are supposed to be more diagnostically discriminating.

Figure 1
Fig. 1 ICA-PCA analysis process on the reflectance images.
summarizes the whole analysis procedure to obtain the maps of tumor probability: PCA is first applied to the logarithm of the initial reflectance data set, containing 512 images, one per wavelength Fig. 1(a). Then, a few uncorrelated components are maintained Fig. 1(b) and are input into the ICA algorithm. A tumor map probability Fig. 1(c) is computed from the resulting independent components, which, because of the more stronger condition mentioned above, are expected to be more diagnostically relevant and unmixed than the principal components attained in the immediate prior analysis stage. To obtain the independent components, a FastICA algorithm was employed that is based on maximization of the fourth statistical moment, i.e. kurtosis. It is computationally simple, fast and requires little memory space [21

21. R. Bro, E. Acar, and T. G. Kolda, “Resolving the sign ambiguity in the singular value decomposition,” J. Chemometr. 22(2), 135–140 (2008). [CrossRef]

].

2.2.4 Solving ambiguity problems from PCA and ICA

The two linear analyses, PCA and ICA, jointly described as in Eq. (3), have another point in common that needs to be emphasized: sign ambiguity. The resulting component s could be multiplied by −1 and the solution of Eq. (3) would be identical and correct. Moreover, ICA has another ambiguity on the order of result components.

PCA analysis is based on the extraction of the singular value decomposition matrix (SVD) and there is mathematically no way to avoid this sign ambiguity arising from a multiplicative term such as the pair of singular vectors [21

21. R. Bro, E. Acar, and T. G. Kolda, “Resolving the sign ambiguity in the singular value decomposition,” J. Chemometr. 22(2), 135–140 (2008). [CrossRef]

]. In ICA, the variance of the independent components cannot be determined [20

20. A. Hyvärinen, “Fast and robust fixed-point algorithms for independent component analysis,” IEEE Trans. Neural Netw. 10(3), 626–634 (1999). [CrossRef] [PubMed]

], so the magnitudes of the independent components may be fixed, but this still leaves the ambiguity of the sign. Several strategies have been tested to deal with the ambiguity problems of BSS analysis. The sign ambiguity of PCA can be solved with the spectral interpretation of the unmixed signals, σn(λ). However, ICA in this case does not permit a spectral interpretation. Concerning order ambiguity, while PCA components are ordered by variance, the intrinsic order ambiguity of ICA impedes a discriminating rank of independent components [20

20. A. Hyvärinen, “Fast and robust fixed-point algorithms for independent component analysis,” IEEE Trans. Neural Netw. 10(3), 626–634 (1999). [CrossRef] [PubMed]

].

FastICA is a recursive algorithm that starts with an initial guess. If this initial guess is not fixed, the algorithm begins with a random matrix resulting in different signs and orders of the output signals, even if it initiates from the same set of measurements. The W matrix resulting from PCA, i.e. the uncorrelated coefficients, is proposed as the initial seed to limit this ambiguity effect. Additionally, surgeon-guidance is proposed to compensate for this ambiguity. Visual inspection of results reveals that one significant independent component is sufficient to distinguish benign from malignant pathologies. The surgeon could guide selection of the significant component or alternatively, the significant component could be identified by cross-correlating a digital photograph of the sample with its spectrally-derived ICA parameters, mimicking the surgeon’s viewpoint. Even though sign ambiguities in the magnitude of the selected independent component still would induce an error in the tissue category assignment. To this end, a calibration method is employed in which the user, ultimately the surgeon, specifies a set of known pixels, i.e. obviously malignant tissue at the center of the lesion. Informed with this initial information, ICA then provides a map of tumor extent.

2.3 Well-known point’s strategy

The requirement for the sign ambiguity in ICA to be corrected and the approach to implement this correction vary among applications [20

20. A. Hyvärinen, “Fast and robust fixed-point algorithms for independent component analysis,” IEEE Trans. Neural Netw. 10(3), 626–634 (1999). [CrossRef] [PubMed]

], although a majority of them are based on the employment of supervised classifiers after the ICA process. In the present tissue diagnosis application, different pairs of tissue regions (adipose-benign, adipose-malignant, benign-malignant) become well separated by the PCA-ICA combination but the sign ambiguity introduces a constraint in the tissue category assignment to perform an absolutely blind selection. In the validation against the pathologist-based diagnosis, it is precisely the sign of the magnitude of the selected independent component the one that differentiates between tissue diagnoses. A pair of two possibilities of diagnosis can be considered corresponding to positive sign and negative sign regions. However, the same sign is not always associated to the same pathology for different patients.

The proposed procedure however needs some a priori knowledge about the sign that is associated per tissue type, since this information is required to specify the associated tissue category (malignant or non-malignant) in the final guidance map. Taking advance of both their experience and the information provided by pre-interventional techniques, surgeons are able to clearly identify the tumor center and healthy tissue. The main difficulty they face is the accurate delineation of the malignant area far from the center. This is the point where the proposed guidance map would be of great interest. Once surgeons are asked to locate malignant and non-malignant centers, these points will work as calibration points for the algorithm identifying the actual sign for malignancy regions. In order to emulate this surgeon selection, 25 pixels on each ROI have been selected as “well-known”, contrasted points to be certainly diagnosed. Then a detection mask can be easily created. This process is summarized in Fig. 2
Fig. 2 ICA discriminant results with approximate scale bar: (a) and (b) are both correct solutions due to sign ambiguity, where the magenta (strongest tone) are values with positive signs. When surgeon selects some points of the image where the pathology is sure (circles on (c)) the region of malignancy (striped ROI) can be defined as a positive (magenta) region, avoiding the sign ambiguity.
.

3. Results and discussion

The goal of this paper is to design a blind data analysis, i.e. without model fitting, to segment tumor from normal tissues in lumpectomy specimens using localized, measured broadband reflectance. This blind analysis is designed to discriminate between areas in a single tissue sample and not between samples. This is acceptable for margin detection because the success of BCT is measured by accurate tumor delineation within each patient. Results will compare the performance obtained by PCA, PCA-ICA and the extraction of optical parameters according to an empirical approximation to Mie theory [18

18. A. M. Laughney, V. Krishnaswamy, P. B. Garcia-Allende, O. M. Conde, W. A. Wells, K. D. Paulsen, and B. W. Pogue, “Automated classification of breast pathology using local measures of broadband reflectance,” J. Biomed. Opt. 15(6), 066019 (2010). [CrossRef] [PubMed]

]. The metrics considered to address the performance are the probability of detection and false alarm (sensitivity and specificity) of PCA-ICA and PCA itself.

3.1 PCA results: uncorrelation has less strong diagnostic ability than independency

For blind signal separation, PCA is applied to reduce the dimensionality of broadband reflectance data, to estimate the number of components used to inform a diagnosis, and to analyze if their diagnostic relevance.

When the A matrix of PCA coefficients of Eq. (3) is qualitatively observed, the first principal coefficient displays a constant spectral tendency while the second shows exponential or negative logarithm behavior. Figure 4
Fig. 4 Mean spectral variations of PC1, PC2 and PC3 coefficients on matrix (A) for the whole number of breast samples, where “prop.” means proportional.
represents the mean of the first three principal coefficients along the 29 samples. The optical system is optimized not to detect absorption, but just scattering [18

18. A. M. Laughney, V. Krishnaswamy, P. B. Garcia-Allende, O. M. Conde, W. A. Wells, K. D. Paulsen, and B. W. Pogue, “Automated classification of breast pathology using local measures of broadband reflectance,” J. Biomed. Opt. 15(6), 066019 (2010). [CrossRef] [PubMed]

]. Nonetheless the third principal coefficient exhibits high correlation with absorption by hemoglobin.

Considering the Mie linear approximation of reflectance as noted in Eq. (4), the similarity with PCA results is found as stated by Eq. (8):
ln(R(λ))= pc1- pc2 ln(λ) -pc2 K(λ)
(8)
Being K(λ) the exponent of hemoglobin absorbance on the model and pcn would be the PCA scores, as defined in Eq. (4).

If this supposition of likeness was right, similarity between PCA scores and model-based parameters should be found. Table 2

Table 2. Correlation Study To Determine the Hemoglobin Presence on Each Principal Component*

table-icon
View This Table
| View All Tables
shows the correlation between the mean PCA scores and optical parameters extracted from the empirical approximation fitting given by Eq. (1), to assess the contribution of scattering power and hemoglobin absorption to each score.

Although a high correlation with Mie power scattering is found on PC2 (Fig. 4), and hemoglobin absorption is usually collected on PC3 (Fig. 4), this relationship does not necessarily define PC2 and PC3 as scattering power and hemoglobin, like in a conventional model fitting extraction. However, some similarities are found on the behavior of the statistical features (PCA scores) and the optical features (scattering and absorption from model) which may suggest that BSS analysis accounts for physical variation of parameters of the tissue. Figure 5
Fig. 5 Blind analysis and physical properties of scattering with an approximation of the scale bar: (a) scattering power map from model fitting; (b) score of PC2; (c) scatter plot showing their correlation.
shows the maps of the PC2 scores (Fig. 5(a)) for a specific tissue sample when compared with the scattering power map (Fig. 5(b)) obtained from Eq. (1). High correlation between both maps can be observed that is shown also in the associated scatter plot (Fig. 5(c)). Figure 6
Fig. 6 Blind analysis and physical properties of absorption: (a) digital photograph with blood points in black circles and approximate scale bars, (b) Coefficient, (eigenvector) of the 3th principal component, with its spectral variation, (c) PC3 score.
represents the influence of the hemoglobin in the scores of PC3. In the digital photograph of the sample (Fig. 6(a)) some blood pools can be observed. The spectral variation of PC3 (Fig. 6(b)) shows similarities with the hemoglobin spectrum and the map of scores of the PC3 exhibits high values in the areas where the blood pools are located (Fig. 6(c)).

ICA was then applied to the three most significant PCA components, determined by kept variance, to extract the independent maps of each sample, for improved discrimination. These independent features were used to classify the tissues, and results were compared with the uncorrelated features to check if the independency stronger statistical condition translates into better classification accuracy.

3.2 ICA for the extraction of independent maps

While PCA ambiguity is easy to solve through a correlation with the spectral signatures of tissue chromophores, FastICA is an iterative algorithm that causes two types of ambiguities and such a study is not so straightforward. Because of this, and as mentioned above, PCA matrix (coefficients) is used as the initial seed for the ICA algorithm. Even under this premise, it was not possible to deal with these ambiguities analytically.

By visual inspection it seems easy to determine which component is most discriminating, but automation of this analysis is desirable. The digital photograph of each tissue sample also provides useful information as it mimics surgeons vision. Correlation with the digital photograph is proposed as a fast solution to determine which ICA score is more interesting for diagnostic purposes. This procedure could emulate the surgeon behavior. Figure 7
Fig. 7 IC selection based on correlation with the digital photograph of a malignant sample. Circles correspond to the areas identified as malignant by the pathologist, scale bars are approximated. (a) Digital photograph of the breast specimen; (b) last IC with its correlation and probability of detection; (c) penultimate IC with its correlation and probability of detection; (d) H&E section of the sample.
shows one of the tissue samples and the probability of detection (Pd) and degree of correlation (R) when compared with the digital photograph of the tissue (Fig. 7(a)) in case of selection of the last IC (Fig. 7(b)) or the penultimate IC (Fig. 7(c)). The H&E section is also shown (Fig. 7(d)) for visualization purposes. In this sample, the last IC exhibits the highest correlation with the digital photograph and also achieves the highest probability of detection when results are validated against the ROI’s information provided by the pathologist.

Then, after the best ICA score for classification purposes is selected, and according with to the values in the ICA space, a probability of tumor can be calculated. Figure 8 shows the results for 4 different samples with diverse pathologies malignant (purple), non-malignant (white) and adipose (cyan). The last column shows the images of the map for tumor probability and good agreement can be found by visual inspection when compared with the H&E section and the pathologist ROIs. For a more quantitative assessment, Table 4

Table 4. Comparison Between the Outcomes of Sensitivity and Specificity of a Classifier of Malignancy for the Different BSS Strategies and a Supervised Technique as KNN

table-icon
View This Table
| View All Tables
shows sensitivity and specificity outcomes detecting malignant points and a comparison of strategies to fix and select the diagnostic map. The best results depend on the chosen ICA score, being better when it becomes fixed by visual inspection. The correlation with the digital photograph of the specimen helps in the determination of the best score but its performance is a bit lower than the provided by PCA first score.

4. Conclusions

The feasibility of PCA and ICA to blindly detect and localize breast tissue pathologies is proposed and successfully checked in this paper. To preserve the tissue properties, elastic scattering that requires low optical power is used. The analysis here proposed is designed to discriminate between tissue areas within a single sample, and not between samples, with the ultimate goal of surgeon guidance for Breast Conserving Therapy purposes.

Computationally efficient BSS analysis has been directly applied to 512 optical localized reflectance breast measurements, instead of reflectance model fitting, to readily identify their corresponding cancer pathologies. Reflectance is directly obtained from endogenous tissue properties, mainly scattering from tissue morphology, without injection of contrast agents that require expensive biocompatibility studies and regulatory approval for clinical use.

PCA reduces the dimension of the data set, from the initial 512 spectral bands to just 3-5 uncorrelated components. The latter exhibit significant similarities with the parameters extracted based on an empirical model based on the Mie theory, specifically scattering power and the hemoglobin absorption spectrum. They can additionally be used as classification features by applying a linear threshold. However, the statistical feature of uncorrelation is softer and less significant than statistical independence so ICA has been employed to compare results of classification. Combined PCA-ICA analysis has provided the best significant diagnosis maps with probability of tumor information. Discriminating spectral information, sometimes lost in empirical approximations of light scattering, contributes here to a better tissue type separation.

Sign ambiguities limiting discrimination by ICA have been resolved by selecting some “well-known” points that the surgeon can provide in a real scenario to determine a calibration environment. However, ambiguity arising from the order in which the scores are generated has been a challenge. The selected criterion to confront this ambiguity is to correlate the ICA results with the corresponding digital photograph of the tissue. The best sensitivity-specificity possible attained with ICA is 96%-95% while “photograph correlation for selection” solving proposal yielded 93%-81%. Therefore a loss in the classification is induced if the selection of best score is not optimized. However, both ICA solutions are still better choice than the selection of the second PCA score, which presents 86%-74%.

Furthermore, important correlation between tumor probability and H&E maps is also obtained, which suggest that a future application of the system could be margin delimitation. The goal of this approach is not to diagnose malignancy but to map its extent. During surgery the tumor is already localized, so a seeding of the algorithm by the surgeon is feasible.

Acknowledgments

This work has been supported by the Spanish Government through CYCIT projects DA2TOI (FIS2010-19860), TFS (TEC2010-20224-C02-02) and Alma’s Eguizabal PhD Grant (FPU12/04130) and by Dartmouth College.

References and links

1.

World Health Organization (2008), http://www.who.int/en/

2.

F. Fitzal, O. Riedl, and R. Jakesz, “Recent developments in breast-conserving surgery for breast cancer patients,” Langenbecks Arch. Surg. 394(4), 591–609 (2009). [CrossRef] [PubMed]

3.

R. G. Pleijhuis, M. Graafland, J. de Vries, J. Bart, J. S. de Jong, and G. M. van Dam, “Obtaining adequate surgical margins in breast-conserving therapy for patients with early-stage breast cancer: current modalities and future directions,” Ann. Surg. Oncol. 16(10), 2717–2730 (2009). [CrossRef] [PubMed]

4.

S. Srinivasan, B. W. Pogue, S. Jiang, H. Dehghani, C. Kogel, S. Soho, J. J. Gibson, T. D. Tosteson, S. P. Poplack, and K. D. Paulsen, “Interpreting hemoglobin and water concentration, oxygen saturation, and scattering measured in vivo by near-infrared breast tomography,” Proc. Natl. Acad. Sci. U.S.A. 100(21), 12349–12354 (2003). [CrossRef] [PubMed]

5.

A. M. Laughney, V. Krishnaswamy, E. J. Rizzo, M. C. Schwab, R. J. Barth, B. W. Pogue, K. D. Paulsen, and W. A. Wells, “Scatter spectroscopic imaging distinguishes between breast pathologies in tissues relevant to surgical margin assessment,” Clin. Cancer Res. 18(22), 6315–6325 (2012). [CrossRef] [PubMed]

6.

V. Krishnaswamy, P. J. Hoopes, K. S. Samkoe, J. A. O’Hara, T. Hasan, and B. W. Pogue, “Quantitative imaging of scattering changes associated with epithelial proliferation, necrosis, and fibrosis in tumors using microsampling reflectance spectroscopy,” J. Biomed. Opt. 14(1), 014004 (2009). [CrossRef] [PubMed]

7.

S. C. Kanick, H. J. C. M. Sterenborg, and A. Amelink, “Empirical model of the photon path length for a single fiber reflectance spectroscopy device,” Opt. Express 17(2), 860–871 (2009). [CrossRef] [PubMed]

8.

G. Zonios and A. Dimou, “Modeling diffuse reflectance from homogeneous semi-infinite turbid media for biological tissue applications: a Monte Carlo study,” Biomed. Opt. Express 2(12), 3284–3294 (2011). [CrossRef] [PubMed]

9.

S. L. Jacques and S. Prahl, Oregon Medical Laser Center (2010).

10.

J. Glatz, N. C. Deliolanis, A. Buehler, D. Razansky, and V. Ntziachristos, “Blind source unmixing in multi-spectral optoacoustic tomography,” Opt. Express 19(4), 3175–3184 (2011). [CrossRef] [PubMed]

11.

I. Schelkanova and V. Toronov, “Independent component analysis of broadband near-infrared spectroscopy data acquired on adult human head,” Biomed. Opt. Express 3(1), 64–74 (2012). [CrossRef] [PubMed]

12.

S. Kohno, I. Miyai, A. Seiyama, I. Oda, A. Ishikawa, S. Tsuneishi, T. Amita, and K. Shimizu, “Removal of the skin blood flow artifact in functional near-infrared spectroscopic imaging data through independent component analysis,” J. Biomed. Opt. 12(6), 062111 (2007). [CrossRef] [PubMed]

13.

J. Virtanen, T. Noponen, and P. Meriläinen, “Comparison of principal and independent component analysis in removing extracerebral interference from near-infrared spectroscopy signals,” J. Biomed. Opt. 14(5), 054032 (2009). [CrossRef] [PubMed]

14.

J. L. Semmlow, Biosignal and Biomedical Image Processing: MATLAB-Based Applications (CRC Press, 2004), Chap. 9.

15.

R. Gallardo-Caballero, C. J. García-Orellana, H. M. González-Velasco, and M. Macías-Macías, “Independent component analysis applied to detection of early breast cancer signs,” in Proceeding of 9th International Work-Conference on Artificial Neural Networks (IWANN, San Sebastian, Spain, 2007), pp. 988–995.

16.

I. Kopriva and A. Peršin, “Unsupervised decomposition of low-intensity low-dimensional multi-spectral fluorescent images for tumour demarcation,” Med. Image Anal. 13(3), 507–518 (2009). [CrossRef] [PubMed]

17.

F. Abu-Amara and I. Abdel-Qader, “Detection of breast cancer using independent component analysis,” in Proceedings of IEEE International Conference on Electro/Information Technology (Institute of Electrical and Electronics Engineers, New York, 2007), pp.428–431. [CrossRef]

18.

A. M. Laughney, V. Krishnaswamy, P. B. Garcia-Allende, O. M. Conde, W. A. Wells, K. D. Paulsen, and B. W. Pogue, “Automated classification of breast pathology using local measures of broadband reflectance,” J. Biomed. Opt. 15(6), 066019 (2010). [CrossRef] [PubMed]

19.

A. Hyvärinen and E. Oja, “Independent component analysis: algorithms and applications,” Neural Netw. 13(4-5), 411–430 (2000). [CrossRef] [PubMed]

20.

A. Hyvärinen, “Fast and robust fixed-point algorithms for independent component analysis,” IEEE Trans. Neural Netw. 10(3), 626–634 (1999). [CrossRef] [PubMed]

21.

R. Bro, E. Acar, and T. G. Kolda, “Resolving the sign ambiguity in the singular value decomposition,” J. Chemometr. 22(2), 135–140 (2008). [CrossRef]

OCIS Codes
(170.6510) Medical optics and biotechnology : Spectroscopy, tissue diagnostics
(170.6935) Medical optics and biotechnology : Tissue characterization

ToC Category:
Spectroscopic Diagnostics

History
Original Manuscript: March 27, 2013
Revised Manuscript: May 10, 2013
Manuscript Accepted: May 21, 2013
Published: June 12, 2013

Citation
Alma Eguizabal, Ashley M. Laughney, Pilar Beatriz García-Allende, Venkataramanan Krishnaswamy, Wendy A. Wells, Keith D. Paulsen, Brian W. Pogue, Jose M. Lopez-Higuera, and Olga M. Conde, "Direct identification of breast cancer pathologies using blind separation of label-free localized reflectance measurements," Biomed. Opt. Express 4, 1104-1118 (2013)
http://www.opticsinfobase.org/boe/abstract.cfm?URI=boe-4-7-1104


Sort:  Author  |  Year  |  Journal  |  Reset  

References

  1. World Health Organization (2008), http://www.who.int/en/
  2. F. Fitzal, O. Riedl, and R. Jakesz, “Recent developments in breast-conserving surgery for breast cancer patients,” Langenbecks Arch. Surg.394(4), 591–609 (2009). [CrossRef] [PubMed]
  3. R. G. Pleijhuis, M. Graafland, J. de Vries, J. Bart, J. S. de Jong, and G. M. van Dam, “Obtaining adequate surgical margins in breast-conserving therapy for patients with early-stage breast cancer: current modalities and future directions,” Ann. Surg. Oncol.16(10), 2717–2730 (2009). [CrossRef] [PubMed]
  4. S. Srinivasan, B. W. Pogue, S. Jiang, H. Dehghani, C. Kogel, S. Soho, J. J. Gibson, T. D. Tosteson, S. P. Poplack, and K. D. Paulsen, “Interpreting hemoglobin and water concentration, oxygen saturation, and scattering measured in vivo by near-infrared breast tomography,” Proc. Natl. Acad. Sci. U.S.A.100(21), 12349–12354 (2003). [CrossRef] [PubMed]
  5. A. M. Laughney, V. Krishnaswamy, E. J. Rizzo, M. C. Schwab, R. J. Barth, B. W. Pogue, K. D. Paulsen, and W. A. Wells, “Scatter spectroscopic imaging distinguishes between breast pathologies in tissues relevant to surgical margin assessment,” Clin. Cancer Res.18(22), 6315–6325 (2012). [CrossRef] [PubMed]
  6. V. Krishnaswamy, P. J. Hoopes, K. S. Samkoe, J. A. O’Hara, T. Hasan, and B. W. Pogue, “Quantitative imaging of scattering changes associated with epithelial proliferation, necrosis, and fibrosis in tumors using microsampling reflectance spectroscopy,” J. Biomed. Opt.14(1), 014004 (2009). [CrossRef] [PubMed]
  7. S. C. Kanick, H. J. C. M. Sterenborg, and A. Amelink, “Empirical model of the photon path length for a single fiber reflectance spectroscopy device,” Opt. Express17(2), 860–871 (2009). [CrossRef] [PubMed]
  8. G. Zonios and A. Dimou, “Modeling diffuse reflectance from homogeneous semi-infinite turbid media for biological tissue applications: a Monte Carlo study,” Biomed. Opt. Express2(12), 3284–3294 (2011). [CrossRef] [PubMed]
  9. S. L. Jacques and S. Prahl, Oregon Medical Laser Center (2010).
  10. J. Glatz, N. C. Deliolanis, A. Buehler, D. Razansky, and V. Ntziachristos, “Blind source unmixing in multi-spectral optoacoustic tomography,” Opt. Express19(4), 3175–3184 (2011). [CrossRef] [PubMed]
  11. I. Schelkanova and V. Toronov, “Independent component analysis of broadband near-infrared spectroscopy data acquired on adult human head,” Biomed. Opt. Express3(1), 64–74 (2012). [CrossRef] [PubMed]
  12. S. Kohno, I. Miyai, A. Seiyama, I. Oda, A. Ishikawa, S. Tsuneishi, T. Amita, and K. Shimizu, “Removal of the skin blood flow artifact in functional near-infrared spectroscopic imaging data through independent component analysis,” J. Biomed. Opt.12(6), 062111 (2007). [CrossRef] [PubMed]
  13. J. Virtanen, T. Noponen, and P. Meriläinen, “Comparison of principal and independent component analysis in removing extracerebral interference from near-infrared spectroscopy signals,” J. Biomed. Opt.14(5), 054032 (2009). [CrossRef] [PubMed]
  14. J. L. Semmlow, Biosignal and Biomedical Image Processing: MATLAB-Based Applications (CRC Press, 2004), Chap. 9.
  15. R. Gallardo-Caballero, C. J. García-Orellana, H. M. González-Velasco, and M. Macías-Macías, “Independent component analysis applied to detection of early breast cancer signs,” in Proceeding of 9th International Work-Conference on Artificial Neural Networks (IWANN, San Sebastian, Spain, 2007), pp. 988–995.
  16. I. Kopriva and A. Peršin, “Unsupervised decomposition of low-intensity low-dimensional multi-spectral fluorescent images for tumour demarcation,” Med. Image Anal.13(3), 507–518 (2009). [CrossRef] [PubMed]
  17. F. Abu-Amara and I. Abdel-Qader, “Detection of breast cancer using independent component analysis,” in Proceedings of IEEE International Conference on Electro/Information Technology (Institute of Electrical and Electronics Engineers, New York, 2007), pp.428–431. [CrossRef]
  18. A. M. Laughney, V. Krishnaswamy, P. B. Garcia-Allende, O. M. Conde, W. A. Wells, K. D. Paulsen, and B. W. Pogue, “Automated classification of breast pathology using local measures of broadband reflectance,” J. Biomed. Opt.15(6), 066019 (2010). [CrossRef] [PubMed]
  19. A. Hyvärinen and E. Oja, “Independent component analysis: algorithms and applications,” Neural Netw.13(4-5), 411–430 (2000). [CrossRef] [PubMed]
  20. A. Hyvärinen, “Fast and robust fixed-point algorithms for independent component analysis,” IEEE Trans. Neural Netw.10(3), 626–634 (1999). [CrossRef] [PubMed]
  21. R. Bro, E. Acar, and T. G. Kolda, “Resolving the sign ambiguity in the singular value decomposition,” J. Chemometr.22(2), 135–140 (2008). [CrossRef]

Cited By

Alert me when this paper is cited

OSA is able to provide readers links to articles that cite this paper by participating in CrossRef's Cited-By Linking service. CrossRef includes content from more than 3000 publishers and societies. In addition to listing OSA journal articles that cite this paper, citing articles from other participating publishers will also be listed.


« Previous Article  |  Next Article »

OSA is a member of CrossRef.

CrossCheck Deposited