Introduction
EEG artifact is defined as any electrical potential that is not produced by the brain, e.g., eye movement or head movement, environmental sources, the TV, electric motors, etc. The accepted standards for selecting artifact free EEG were initially established by Hans Berger in the 1920s-1930s [1-3] and have been essentially unchanged since [4]. The standards were: 1-good recording “hygiene” and, 2-visual inspection to test and verify that the parts of the EEG recording used for assessment and analysis had no artifact [4-7]. Since the introduction of digital computers in the 1950s and 1960s [8-11] the standards for the selection of artifact free segments of an EEG recording have remained the same except with the addition of computational artifact detection methods based on the known physics of the EEG which arises essentially inside a sphere versus the physics of external artifacts such as heart beat, muscle, head movements, environmental noise, electrical motors, etc. that are external to the brain. Actually, external artifact sources are usually many times larger than EEG and have different frequencies and phases differences than EEG. This is why since about the 1970s most QEEG software uses the physiological artifact free parts of an EEG record to identify significant deviations and thereby delete or not select artifact for analysis [12-16]. Human scalp recorded EEG waves are unique and special and after viewing 10 or 20 examples one can recognize artifact because artifact is visually obvious and is easy to teach to recognize because external electrical sources (i.e., artifact) can be quickly deleted resulting in sufficient artifact free EEG digital samples to routinely achieve 0.9 and higher test re-test reliability. Importantly, artifact free EEG segments are the standard in clinical neurology for the evaluation of epilepsy and various pathologies [17]. This topic is also important in court cases where the invalidity of a replacement by a distortion of a patient’s original digital time series with artifact can be proven.
Today computer routines are commonly used to detect and delete artifact segments or parts of an EEG recording in a few seconds and mark the EEG traces for visual verification and provide test re-test reliability statistics of the artifact free parts of the recording [12]. The vast majority of the approximately 170,000 EEG studies cited in the National Library of Medicine database (Pubmed) (6/7/20) are Quantitative EEG (QEEG) studies from clinical patients using the established method of deletion of non-brain sources of artifact by deleting artifact and then using test re-test sampling statistics of the artifact free part of the recording to test and verify that the selected samples are repeatable and representative of the patient’s EEG recording without any replacement with an artificial substitute of the original measures.
EEG sources are hubs or clusters of synchronous synaptic potentials organized in brain networks that have different time delays between hubs (conduction velocities, synaptic rise times, synaptic delays in loops, etc.) and therefore exhibit significant and reproducible phase differences at the scalp surface [5-7,18-22]. Over the last 50 years a common practice in clinical EEG is to use a single common recording reference, e.g., an ear or mastoid or scalp electrode, etc. and then calculate the phase differences between scalp electrodes using invertible algebraic operations. Invertibility is defined by a one-to-one mapping of the domain of a function (e.g., electrode locations) to the range of a function (e.g., amplitude and phase differences, etc.). For example, if the instantaneous phase difference between scalp electrode A and the common reference R=60 degrees and the instantaneous phase difference between scalp electrode B and the common reference R=45 degrees, then the phase difference at an instant of time is (A-R)- (B-R)=A-B=60–45=15 degree phase difference.In other words when a common reference is used in a differential amplifier, which is used in the vast majority of EEG studies cited in the National Library of Medicine database, then the phase or time differences due to the common reference electrode ‘R’ cancels leaving the time difference between the two active scalp leads as the physiologically accurate measure of phase difference, i.e., 15 degree instantaneous phase difference between scalp locations A and B. Also phase differences are not recoverable when using an average reference or when the recording is a bipolar montage and not a common reference [7,17]. Clinical Neurology and evaluations of epilepsy depend on accurate measures of phase differences as well as the direction and time delays. For example, a patient with focal left frontal lobe epilepsy that spreads or travels across the corpus callosum to the other hemisphere and/ or to other parts of the brain like the parietal lobe is part or the clinical presentation. Therefore, phase difference in a patient’s EEG is important information in the evaluation of a patient with epilepsy and other neurological disorders.
The property of invertibility is important because it allows for inverse solutions of the 3-dimensional sources of the EEG and based on spatial information related to phase or time differences between separated groups of neurons. This is why a linear filter called Finite Impulse Response (FIR) is used in EEG because of the preservation of phase or time differences in the EEG recording. Infinite Impulse Response filters (IIR) fail to preserve phase and extra steps are needed to recover phase differences with an IIR filter. The mathematics of invertibility allows for algebraic remontaging that can be reversed with no loss of information and there is no distortion of the phase differences present in the original recording [4,6,7,19].
Accurate measures of the scalp electrical potentials and phase differences depends on the physics of the EEG and the use of invertible equations such as the Poisson, Laplace, Green’s equations and Maxwell and Helmholtz equations. Once the mapping of the domain to the range of a function is artificially modified or if nonlinear filtering is used then error is introduced into the equations of physics and also there is a decoupling of the brain from EEG network measures [7,12,23-26]. Unfortunately, recent developments in decomposing the EEG into Independent Components (ICs) and then creating a replacement time series using a smaller set of ICs is a nonlinear filter process that is not invertible and therefore has created an alternate reality by replacing the original voltages by a mathematical construct that lacks mathematical and physiological validity
Independent Components Analysis (ICA) is a solution to the “cocktail party problem” of listening in on one person’s speech in a noisy room. If N voices (sources) are present, at least M microphones (observations) are needed to recover the original signals. This constitutes the square case or M=N that is “invertible” or where one can uniquely determine the input from its output. If there are more sources (neurons) then sensors (microphones) or where M ≠ N then the system is non-invertible. Also, Independent Components Analysis (ICA) is based on three non-physiological assumptions:
1- The sources are independent
2- The sources are non-Gaussian in distribution and,
3- The sources are spatially fixed.
The number of EEG recording electrodes (microphones) is many times smaller than the number of sources. In addition, ICA assumes that the sources (voices) are independent but neurons are organized in clusters or hubs in networks with reciprocal loops and are not independent. Neurons fire with bursts of action potentials and communicate with distant clusters of synchronous neurons by axons and synapse connections with conduction velocities and synaptic delays as a function of the distance between hubs and coordinating structures like the thalamus, cortico-cortical connections, etc. This means that when decomposing the EEG time series into a Set of Independent Components (ICS), then the ICs are not sources of the EEG and cannot be used as a replacement of the EEG electrical field of the brain nor to accurately estimate the inverse solution to link 3-dimensional sources inside of the brain to the 2-dimensional scalp electrical field. Instead ICs are mathematical constructs that decompose the electrical field using a noninvertible and nonlinear method that is also physiologically invalid. Another violation of the ICA model is the fact that all of the generators of the human EEG are not non-gaussian.
Simple transforms such as log base 10 and Box Cox or square root are commonly used in QEEG analyses [4,12,13,27,28] to produce Gaussian distributions and many metrics are naturally Gaussian.
Figure 1 is an example of an approximately Gaussian distribution of phase differences and simple transforms are used to better approximate Gaussian if necessary [12,13,29-31] [Figure 1].
Figure 1: An example of an approximate Gaussian distribution of EEG phase differences. In this instance the phase differences are between the Fp1 and Fp2 scalp locations in the delta band for 2 minutes of artifact free EEG selections. Very similar Gaussian distributions are present for other electrode combinations and different frequency bands.
ICA is invertible when the number of independent components equals the number of scalp electrodes or where there is a one-to-one mapping of the domain (ICs) to the range (loadings). However, ICA is not invertible when it is used to reconstruct a replacement time series (ICA-R) from a smaller domain than the original domain and there is no longer a one-to-one mapping. The ICA artifact correction thought to remove artifact such as eye movement and blinks is an example of a non-invertible process. ICA reconstruction of a substitute noninvertible time series is by a three step process:
1- 1st decompose N scalp EEG recordings into N ICs,
2- Eliminate one or more ICs that subjectively appear to have high artifact loadings and,
3- Then with the smaller set of ICs create a substitute ICA-R time series that replaces the original EEG. The 19 channel ICA-R time series replacement of the original EEG recording invalidates all subsequent phase dependent analyses because of the violation of the property of invertibility. In the 1800s Bernard Reimann solved and defined the universal invertibility properties that analytic functions depend on such as Fourier and Hilbert transforms as well as the inverse function theorem and matrix algebra in general.
Figure 2 illustrates the difference between invertible linear relationships (top) vs. the ICA non-invertible relationship after one or more ICs (range) are removed (bottom).
Figure 2: Illustration of the algebra of “invertible” functions (top) vs “non-invertible” functions (bottom). Top is an EEG recording where manual or semi/manual methods are used to delete artifact from a recording. The bottom is where an EEG recording that includes artifact is subjected to ICA reconstruction. Invertible functions must have a one-to-one correspondence between the domain and range of the function. Failure of a one-to-one mapping is a “non-invertible” function in which there is no inverse function. The mathematics of Poisson, Laplace, Green, Maxwell and Helmholtz are all invertible and non-invertible functions such as ICA reconstruction introduce error in the computations of electricity.
At first glance ICA reconstruction sounds promising but has a serious limitation because the replacement ICA-R time series has altered all of the amplitude and phase differences of the original time series, including the artifact free segments and there is no inverse transform that maps the original time series to the lower number of ICs. For example, Montefusco-Siegmund R, et al. [23] concluded that “in this work we showed that ICA artifact correction would introduce non-linear and non-stationary phase changes over time and across frequencies, setting spurious phase coherence indexes.” Therefore, the assumptions that underlay ICA fail to fit the physiological reality of the human EEG. Because of the violation of the physiological reality of the EEG, Hyvainen A, et al [32] stated: “It should be noted that this technique [ICA] may lead to the insertion of undesirable new artifacts into the brain recordings (Jervis BW, et al. 1989)” [33]. Castellanos NP and Makarov VA [26] concluded: “a separation is valid for independent, linearly mixed sources when their total number does not exceed the number of recording electrodes (Bell AJ and Sejnowski TJ [34]). In practice these assumptions can be violated leading to a “leak” of the cerebral activity into components deemed artificial. Complete rejection of such a component supposes a partial loss of the neural signal.”
Some justify the use of ICA reconstruction by wrongly believing that all EEG recordings contain artifact and therefore there is no ground truth of the EEG. This justification for the use of ICA reconstruction is invalid as attested to by the fact that there are over 170,000 peer reviewed EEG studies cited in the National Library of Medicine database (Pubmed, 2020) with high consistency and high test retest reliability and high cross-validation to various clinical conditions because the artifact free segments of the EEG without the use of ICA were used in these studies. A search of the National Library of Medicine database (Pubmed) using the search terms: “ICA rejection of EEG artifact” yields only 52 citations. If all EEG was contaminated by artifact as the supporters of ICA replacement claim then this literature would not exist nor have high reliability and validity of the EEG [4- 14,35,17,27]. To appreciate the difference of the process, consider this situation: if a recording of 3 minutes contains only one blink artefact of 1 second the ICA-R method will apply (and change) the complete recording (3 minutes) including the artefact free segments in contrast with the traditional artefact removal techniques where only the 1 second artefact containing segment is removed leaving the rest of the original recording unaltered. ICA-R is a very incisive technique that replaces any original signal by a digital reconstruction.
The goal of this study is to provide pre vs. post empirical evidence that Independent Component Analysis (ICA) to reconstruct a substitute time series that visually appears to remove artifact but actually creates artifact and distorts phase differences present in the artifact free parts of the original time series. To counter the argument that all EEG recordings contain artifact we will show that phase distortion occurs for each and every time sample for all combinations of electrodes including the artifact free portions of the EEG. Also, we cite the scientific literature demonstrating high test re-test reliability of EEG as well as high clinical effect sizes which would not exist if EEG was always contaminated by artifact [7,35,27].
Methods
Subjects and EEG recordings
A total of 11 non-epileptic clinical subjects ranging in age from 18 to 33 years (8 males) were included in this study. The EEG was recorded using a Mitsar amplifier model 202 at a sample rate of 250 Hz in the eyes open condition. Three to seven minutes of EEG of 19 channels of EEG was recorded according to the 10/20 electrode locations referenced to linked ears.
Manual artifact free selections vs. ICA reconstruction
Visual inspection and manual deletion of eye movement and blink and any artifact was conducted and the artifact free original EEG was saved in an EDF formatted file. In addition, WinEEG software (Version 2.11) was used to compute blind source separation using Independent Component Analysis (ICA) based on the Infomax algorithm [36,37]. After the decomposition of the 19 channel EEG the component related to eye movement artifacts were manually selected and after removing the eye movement related Independent Component (IC) then based on the remaining 18 Independent Components (ICs) a replacement 19 channel time series was computed. As mentioned previously, this is referred to as ICA replacement or reconstruction and the new replacement 19 channel time series is referred to as the ICA-R time series. The ICA-R data were also saved in EDF format. To verify that the WinEEG ICA-R was computed correctly the original EEG and the WinEEG ICA-R data from a single subject (subject #11 duration of 6 minutes & 51 seconds) was sent to experts at the Univ. of California at San Diego (UCSD) and to the Eeglablist for others to independently evaluate the pre vs. post ICA data. Individuals on the EEG Lab list used standard software [37,38] to perform ICA-R where two independent components representing eye movement and blink artifact were deleted and the remaining 17 Independent Components (ICs) were used to reconstruct a replacement 19 channel time series (ICA-R). This provided a further independent validation of phase difference distortion and also compared the WinEEG ICA-R to the UCSD ICA-R using the same original EEG data.
Spectral analyses
The Hilbert transform was used to compute the “instantaneous” auto and cross-spectrum for each time sample and for all combinations of electrodes for the hand edited original EEG data as well as the ICA-R, time series for each subject. Instantaneous coherence and phase differences were calculated for all channel combinations for four different frequency bands (delta 1-3.99 Hz; theta 4-7.99 Hz; alpha 8-12.99 Hz; beta 13-30 Hz). The time points selected by the hand editing method were saved and also used for an FFT analysis of the ICA-R data. In this way the exact same time samples were used for both the original 19 channel time series and the 19 channel ICA-R time series. Statistical analysis involved first comparing the percent difference between the original hand edited EEG and the ICA-R time series and then paired t-tests were computed between the original hand edited EEG and the ICA-R time series.
Calculation of phase differences
Phase is the position of a point in time (an instant) on a waveform cycle. A complete cycle is defined as the interval required for the waveform to return to its arbitrary initial value. Phase difference is the difference, expressed in radians, degrees or time, between two waves having the same frequency and referenced to the same point in time. Two oscillators that have the same frequency and different phases have a phase difference and the oscillators are said to be out of phase with each other. The amount by which such oscillators are out of phase with each other can be expressed in degrees from 0° to 360°, or in radians from 0 to 2π. If the phase difference is 180 degrees (π radians), then the two oscillators are said to be in antiphase.
Phase shift is any change that occurs in the phase of one quantity, or in the phase difference between two or more quantities. The symbol is sometimes referred to as a phase shift or phase offset because it represents a “shift” from zero phase difference. For infinitely long sinusoids, a change in is the same as a shift in time, such as a time delay. If x(t) is delayed (time-shifted) by 1/4 of its cycle, it becomes:
Where “phase” is now ϕ − π/ 2 and has been shifted by radians (the variable A° represents the amplitude of the wave).
Figure 3 is an illustration of phase difference:
Figure 3: Illustration of a sine wave on the left and phase difference on the right.
A common method to compute phase differences is by first computing the power spectra for x and y (i.e., two different EEG channels) and then the normalized cross-spectra. Since complex analyses are involved this produces the cospectrum (‘r’ for real) and quadspectrum (‘q’ for imaginary). The phase difference between x and y is computed as: phase=159.1549 tan–1 (q/r)/f where f is the center frequency of the Hilbert transform. In complex numbers coherence is the normalized vector length Coh =√r2+q2 phase Ξxy =α is the angle represented by the arc tangent of q/r and SC is the center frequency.
Phase difference is represented by radians or degrees but also in milliseconds when the tangent is multiplied by 159.1549 or phase difference=159.159tan-1(q/r)/SC [7,12,19,21]. The average phased difference for a given electrode spacing and frequency is often computed using the FFT. Mathematically the cross-spectrum between a pair of channels at one frequency is defined as the product of the signal from one channel with the complex conjugate of the signal of the other channel. This produces a two-dimensional vector on the complex plane where the orientation of the vector is equal to the difference of orientations of the phases of the two signal vectors is the measure of EEG phase difference between the two signals.
The Hilbert transform is a Joint-Time-Frequency-Analysis (JTFA) that is used to compute phase differences for a given frequency for each and every time sample and is called instantaneous phase differences. The analytical signal associated with the time series x1(t)isZ1(t)=a1(f)ejφ1(t) where the imaginary part y1(t)=Im{Z1(t) is the Hilbert transform of the signal x1(t) The signal is bandpass filtered around the frequency of interest and then the instantaneous phase is estimated from the analytical signal as: Φx1(t)=arg{x1(t)+j1(t) } The phase difference between two EEG channels x1(t) and x2(t) is φ1, x2(t)=mφx1(t)−nφx2(t) One may compute the 1st derivative of phase differences and if the phase differences are constant over time then this is called “Phase Locking”. Phase lock duration is the period of time between phase shifts or the period of time when the 1st derivative approximates 0 that begins and ends when the 1st derivative is a large absolute value greater than some criteria value called a phase shift [14-16,39,40].
Another expression of phase difference is by Bendat JS and Piersol AG [35] where phase difference is a function of frequency, distance and conduction velocity: i.e.,
Where ΞXY is the phase lead or lag between locations x and y, f is the frequency, d is the distance traveled in meters and c is the conduction velocity in m/sec [35].
As mentioned previously in the brain the physiological ground truth of EEG phase or time differences are based on the fact that neurons are connected in loops and between network hubs with accumulated delays both within the neocortex and between cortical and subcortical structures. The ground truth of EEG phase differences between separated scalp locations are physiological factors of the physics of electrical sources and the connections between sources (i.e., networks) such as axonal conduction velocities, length of connecting axons, synaptic rise times, synaptic delays and synaptic integration times, etc. [4,7,18-20].
Results
Evidence that ICA reconstruction and EEG replacement alters the phase differences of the EEG
The ICA-R reconstruction performed by the UCSD group for one subject were very similar to those performed by the WinEEG software and showed essentially the same differences although there were fewer significant differences using the USCD ICA. Because of space limitations, the UCSD ICA-R method [37] which is a standard and widely used ICA method will be presented as a representative example of the same differences between the original EEG and the ICA-R time series in all of the subjects.
The Hilbert transform confirmed that the phase differences had been altered for each and every time sample and for all electrode combinations in the artifact free sections of the EEG recording. Figure 4 is an example the absolute phase differences in degrees between the O1 channel and the other 18 channels using the Hilbert transform. The left panel is the phase differences in the original recording and the right panel are the phase differences after ICA reconstruction. Visual inspection shows that phase differences are not the same between the left and right panels. The vertical line at 12 seconds and 400 milliseconds from the beginning marks the phase difference at an instant of time between the original EEG recording and the ICA reconstructed time series. The panel of numbers to the right of the tracings is the exact phase differences in degrees for all channel combinations at this instant of time. There was no artifact in this part of the recording and one can demonstrate that phase differences have been altered for each and every time point and all channel combinations and all frequency bands for the entire record. Figures 5 and 6 are additional examples of phase distortion in the artifact free parts of the recording.
Figure 4: The columns on the left are EEG channel labels and the columns on the right are absolute phase differences in degrees for the four frequency bands. The “Original (Non-ICA)” image are absolute phase differences in a patient with attention deficit disorder with respect to O1 and the other 18 channels in the original EEG recording and the right “ICA Reconstruction-(UCSD EEG Lab)” image are the phase differences after the ICA reconstruction for the exact same time points and channel combinations. The box to the right of the phase difference tracings are the phase difference values (degrees) at exactly 12 seconds and 400 milliseconds from the start of the recording. This part of the EEG recording was artifact free. A careful comparison demonstrates that the phase differences have been altered in each and every artifact free time sample for the entire EEG record and for all frequencies and all channel combinations.
Figure 5 is another example of differences in the absolute phase differences in degrees between the T5 channel and the other 18 channels using the Hilbert transform at a different instance of time in the recording where again there was no artifact (1 minute and 21 seconds after the start of the recording). The left panel is the phase differences in the original recording and the right panel are the phase differences after ICA reconstruction. Visual inspection shows that phase differences are not the same between the left and right panels. The panel of numbers to the right of the tracings is the exact phase differences in degrees for all channel combinations at this instant of time.
Figure 5: The columns on the left are EEG channel labels and the columns on the right are absolute phase differences in degrees for the four frequency bands. The “Original (Non-ICA)” image are absolute phase differences in a patient with attention deficit disorder with respect to T5 and the other 18 channels in the original EEG recording and the right “ICA Reconstruction-(UCSD EEG Lab)” image are the phase differences after the ICA reconstruction for the exact same time points and channel combinations. The image on the right are the phase differences after the ICA reconstruction for the exact same time points and channel combinations. The box to the right of the phase difference tracings are the phase difference values (degrees) at exactly 1 minute and 21 seconds from the start of the recording. This part of the EEG recording was artifact free. A careful comparison demonstrates that the phase differences have been altered in each and every artifact free time sample for the entire EEG record and for all frequencies and all channel combinations.
Another example of alterations in instantaneous phase differences in artifact free parts of the EEG recording is shown in figure 6. The absolute phase differences in degrees are between the P4 channel and the other 18 channels using the Hilbert transform at a different instance of time in the recording where again there was no artifact (3 minutes and 4 seconds after the start of the recording). The left panel is the phase differences in the original recording and the right panel are the phase differences after ICA reconstruction. Visual inspection shows that phase differences are not the same between the left and right panels. The panel of numbers to the right of the tracings is the exact phase differences in degrees for all channel combinations at this instant of time.
Figure 6: The columns on the left are EEG channel labels and the columns on the right are absolute phase differences in degrees for the four frequency bands. The “Original (Non-ICA)” image are absolute phase differences in a patient with attention deficit disorder with respect to P4 and the other 18 channels in the original EEG recording and the right “ICA Reconstruction-(UCSD EEG Lab)” image are the phase differences after the ICA reconstruction for the exact same time points and channel combinations. The image on the right are the phase differences after the ICA reconstruction for the exact same time points and channel combinations. The box to the right of the phase difference tracings are the phase difference values (degrees) at exactly 3 minutes and 4 seconds from the start of the recording. This part of the EEG recording was artifact free. A careful comparison demonstrates that the phase differences have been altered in each and every artifact free time sample for the entire EEG record and for all frequencies and all channel combinations.
Alteration of Phase Differences by ICA Replacement (ICA-R) Using the FFT
EEG analyses of brain networks requires using measures such as phase delays, coherence, lagged coherence, phase slope index, phase lock, phase shift, phase-amplitude coupling, cross-frequency coupling and directed coherence to name a few commonly used methods. The scientific literature on this range of network measures is large and beyond the scope of this paper to review. However, all of these measures of network dynamics depend on accurate and reliable measures of the electrical phase differences in the original recording and if phase differences between channels in the original EEG recording are irreversibly altered using an invalid method then all subsequent network analyses are invalid.
Statistically significant differences between the phase differences in the original EEG recording vs. post ICA-R were present in all eleven subjects. Table 1 show that the percentage of statistically significant comparisons (P < 0.05) for 171 pairs of channels ranged from 13.5% to 98.2%.
T-TESTs |
Overall |
Delta |
Theta |
Alpha |
Beta |
Subject 1 |
54.8% |
39.8% |
67.8% |
55.0% |
56.7% |
Subject 2 |
95.5% |
97.7% |
98.2% |
87.7% |
98.2% |
Subject 3 |
34.5% |
21.6% |
42.1% |
20.5% |
53.8% |
Subject 4 |
19.2% |
18.1% |
13.5% |
20.5% |
24.6% |
Subject 5 |
25.7% |
34.5% |
23.4% |
19.3% |
25.7% |
Subject 6 |
95.8% |
96.5% |
95.3% |
94.7% |
96.5% |
Subject 7 |
45.3% |
28.1% |
62.6% |
33.3% |
57.3% |
Subject 8 |
64.6% |
53.8% |
76.0% |
60.8% |
67.8% |
Subject 9 |
33.9% |
21.6% |
33.9% |
14.0% |
66.1% |
Subject 10 |
75.9% |
80.7% |
80.1% |
78.4% |
64.3% |
Subject 11 |
52.9% |
54.4% |
59.6% |
41.5% |
56.1% |
% Average |
54.4% |
49.7% |
59.3% |
47.8% |
60.7% |
Table 1: Percentage of statistically significant (P<0.05) ICA-R changes in phase difference between the original EEG and the ICA-R replacement of the
original EEG. Subject 11 is the UCSD Eeglablist subject.
Figure 7 is an example of the percent difference in phase differences between channels for the original pre-ICA record vs. the Eeglablist (UCSD) post ICA-reconstructed time series (ICA-R time series).
Figure 7: The rows are channel pairs for the left and right hemisphere. The columns are percent difference of the phase values in the delta, theta, alpha and beta frequency bands. FFT Percent difference between 1 min & 42 seconds of artifact free EEG in the original time series vs the ICA reconstructed time series using the same time points of 1 min & 42 sec. The smallest difference was 0.59% for Fp1-Fp2 in the alpha band and the largest difference was 99.8% for P3-P4 in the alpha band.
Figure 8 are the results of paired t-tests between the same 1 min & 42 sec of artifact free parts of the a patient’s original EEG phase differences vs. the same time points in post ICA-R’ reconstructed phase differences conducted by the UCSD Eeglablist method where 87.5% were statistically significant (P<0.05).
Figure 8: The rows are channel pairs for the left and right hemisphere. The columns are paired t-test results for the delta, theta, alpha and beta frequency bands between 1 min & 42 seconds of artifact free EEG in the original time series vs the ICA reconstructed time series using the same time points of 1 min & 42 sec. A total of 256 t-tests were conducted and 96.5% were statistically significant.
The same large changes in other network metrics have been evaluated for numerous subjects. Because of page limitations it is not possible to present all of these analyses. Suffice it to say that all of the network analyses demonstrated similar magnitude of difference between the original artifact free parts of a recording and the post ICA replacement time series. A You Tube video demonstration of the alteration of phase differences by ICA-R replacement is at: http:// youtu.be/BfqCh2UeJik
Discussion
The results of this study support the conclusions of numerous distinguished scientists and the Montefusco-Siegmund R, et al. [23] simulation analyses that: “in this work we showed that ICA artifact correction would introduce non-linear and non-stationary phase changes over time and across frequencies, setting spurious phase coherence indexes.” The findings in this study also support the cautionary words of Mannan MMN, et al. [24]: “Although the performance of ICA is promising, it should be employed with care [25]. Also Castellanos NP and Mkarov VA [26] stated that “the artifact suppression may also corrupt the power spectrum of the underlying neural activity.” and that “Rejection of such components supposes a loss of a part of the cerebral activity and, consequently, distortion of the artifact free EEG.
Hyvarinen A, et al. [32] stated: “In noisy ICA, we also encounter a new problem: estimation of the noise-free realizations of the independent components (ICs). The noisy model is not invertible” [32], Hyvarinen A, et al. [32] further state: “A difficult problem in independent component analysis (ICA) is encountered if the number of mixtures xi is smaller than the number of independent components. This means that the mixing system is not invertible” [32]. “Thus, the mixing matrix has size m x n with n>m, and therefore it is not invertible.” [32]. Wallstrom G, et al. [42] concluded: “We were disappointed by the performance of ICA for artifact correction, both in our simulation study and in investigations with real data.” and that: “studies are needed to further investigate the potential spectral distortions induced by ICA”.
Finally, the results of the present study are consistent with statements in 2014-2017 by experts on ICA at the University of California at San Diego and elsewhere:
• “If you remove IC and reconstruct channel EEG by back projecting the remaining ICs, of course it changes channel EEG phase!” (Makoto Miyakoshi, Eeglablist ICA and signal phase content, Sept. 16, 2014)
• “The EEG reconstruction after removing bad components/sources MAY change the phase value of the signal at any electrode.” (M. Rezazadeh Eeglablist ICA and signal phase content, Sept. 18, 2014).
• “The reconstructed data after removing spurious ICA components differs from the original time series, and because of that there are phase differences.” (Arnaud Delorme, Eeglablist ICA misinformation, June 10, 2017).
• “I first noticed the problem with phase distortion more than a decade ago” (Robert Lawson, Eeglablist ICA misinformation, June 14, 2017).
• “I think he is right that the relative phase will be changed by deleting 1 or 2 artifact components.” (Ramesh Srinivasan, Eeglablist ICA misinformation, June 14, 2017).
• “We found phase distortions in the 8-10 Hz alfa band (greatest near the source of artefact) but also on more remote electrodes such as occipital and also in artefact free strokes of EEG.” (Georges Otte, Eeglablist ICA misinformation, June 15, 2017).
In some situations phase differences are irrelevant, for example, when measuring AM radio waves. In other situations phase differences are critical, for example, TV signals. In evoked potential analyses phase differences are not considered important because of the assumption that all of EEG is “volume conduction” that is defined as phase difference equals zero. However, in EEG network analyses phase differences are critical, for example, coupling strength or the direction and magnitude of information flow or conduction velocities, etc. ICA replacement is like an IIR filter that alters both amplitude and distorts phase. If one uses a FIR filter or a zero-phase-shift digital filter then there is no phase distortion. Therefore, the answer to the question “does filtering make things worse?” is YES especially when each and every time sample is distorted in an entire EEG recording that does not contain artifact and the transform is non-invertible.
There are two main categories of artifact correction of the EEG:
1- Deletion of the parts of the recording that contain artifact without any alteration of the artifact free portions of the recording and,
2- Mathematical decomposition and then replacement of a non-invertible time series that has altered all of the values of the original time series including the artifact free parts of the recording. The recording reference is important in the first category but is irrelevant in the second category because the alteration of the time samples in the original recording decouples the subject’s brain from the underlying ground truth physics of the EEG. ICA replacement replaces the original time series with artifact and once alteration of data at the primary level occurs then all subsequent network analyses, including re-referencing are invalid. While amplitude alteration by ICA replacement may have little impact on event related potentials the situation with spontaneous EEG is more serious. More significant is the lack of justification for any alteration at the primary level of data in the first place.
Over 120,000 peer reviewed quantitative EEG studies have been published since the late 1950s that use standard artifact deletion methods without any alteration in the artifact free parts of the EEG recording (see National Library of Medicine Pubmed search using EEG as the search term). Mathematical decomposition of the EEG without replacement of an altered time series was introduced in the 1960s [43,44]. For example, principal components analysis decomposes EEG features that are correlated and varimax rotation to separate loadings on orthogonal axes. Independent Component Analysis (ICA) was introduced in the 1990s [42,34] and differs from principal components analysis by decomposing a multivariate signal into independent nonGaussian components. ICA is a special case of blind source separation such as in the “cocktail party problem” of listening in on one person’s speech in a noisy room. ICA is a very useful decomposition method but lacks scientific validity when it comes to removing ICs and the replacement of the original EEG by an “artifact” that separates measurement from the ground truth of the electrogenesis of the EEG and the invertible physics of electrostatics described by Gauss, Laplace, Poisson, Green, Maxwell and von Helmholtz. A search of the National Library of Medicine (Pubmed) using the search terms “EEG test retest reliability” produces over 3,000 citations since the 1970s with quantitative EEG. To the best of our knowledge none of these citations used ICA replacement and all of them used the artifact deletion method with P>0.9 test re-test reliabilities. Given this history there is no justification for the replacement of the artifact free parts of an EEG recording with artifact by ICA-R replacement when simple deletion of artifact by deletion is the historic standard.
Unfortunately, there are a growing number of EEG connectivity publications where phase differences and/or coherence were calculated after ICA-R replacement. The problem is a lack of validity because the non-invertible alteration of phase differences are not replicable and the results are not accurate measures of brain connectivity because the original physiologically dependent phase differences have been arbitrarily and irreversibly altered. We have found twenty exemplar publications that computed scalp EEG phase differences after ICA reconstruction/replacement and we are listing them here as a word of caution to note that the findings in these studies lack validity and reliability [45-65].
We recommend that the authors of these studies re-visit their raw data using standard artifact free selection methods and not ICA-R (a replacement of the artifact free parts of the original EEG) and then recalculate the EEG connectivity measures based on the invertible physics selections without any phase distortion. Then compare their results to the ICA-R dependent results. We also recommend that a standard for reviewers of EEG studies that used ICA-R to ask the authors to compare their results with and without the use of ICA-R replacement and if there are differences then to ask the authors to explain the validity and reliability of the differences. Finally, for legal and ethical purposes it is important to not replace a patient’s artifact free parts of their EEG medical record with a distorted digital time series without the patient’s knowledge and consent.
Invalidation of the physics of the EEG
The reason that ICA invalidates the physics of electrostatics is because it replaces the original electrical potential measures with noninvertible substitutes of the original electrical potentials and distorts phase differences. For example, a charge q that moves a distance dx where the differences in electrical potential between the initial and final location is dV, then the change in potential d =q dV and the work W used to move the charge is W=-qE dx cos θ where E is the local electrical field strength and θ is the angle between the direction of the field and the x-axis [48]. If one replaces the original electrical potential in volts with an irretrievably altered value (ICA-R), then the change in the charge q and electrical potential is irretrievably invalid where one uses 1 or 18 ICs to create a non-invertible 19 channel alternate substitute reality or dP ≠ dP which is the case with ICA-R. These conclusions were tested in the tables and figures of this study and are consistent with the publications, simulations, mathematics and experts cited previously
Invalid justifications for ICA-R replacement of the original EEG recording
An important issue is the justification for the ICA-R distortion by a non-invertible ICA replacement of the original electrical physics of the human EEG with a mathematical construct that is physiologically and mathematically invalid. There are two frequently used invalid justifications:
1- EEG is all noise to begin with and therefore an alternate ICA universe is of equivalent value, or
2- There is no physiological ground truth of EEG and phase differences in the first place and hence no harm in replacing the original EEG recording. There are discussions about the concept of “non-stationarity” and high density 128 channels vs. 19 channels and the relative magnitude of “corruption” but these are academic without recognition that there must be zero tolerance of distortion at the primary level of recording of the physics of volts and time no matter how many recording sensors were used.
Acknowledgements
We want to thank Arnaud Delorme and others on the UCSD Eeglablist for their comments and dialog and careful ICA analyses of the Mitsar EEG data sample. This study was support by USDA grant (CRS 801-12-09C) and NIH (RR 08079 09). This study was exempt from the requirement of review by an ethics committee because only post hoc statistical analyses on pre-existing data stored in archives from patients who signed a consent form was used and all patient identifiers have been removed.
Declaration of Conflicting Interests
The Author(s) declare that there is no conflict of interest.