1. Introduction
Multiplexing methods providing multiple parallel data channels over
Multimode Fiber (MMF) or waveguides have long been subject to active research, based on wavelength, subcarrier, or angular/modal diversities. The advent of
Multiple Input Multiple Output (MIMO) techniques in wireless transmission has inspired porting MIMO to guided multimode optical media [
1–7
H.R. Stuart, “Dispersive Multiplexing in Multimode Optical Fiber,” Science
289, 281–283 (2000). [CrossRef] [PubMed]
]. It is noted that all MMF MIMO studies to date have addressed
Point-To-Point (PTP) links with
Informed Receivers (IR) estimating the channel for receiver-side MIMO processing, while the transmitter is uninformed of it. Moreover, in most works to date, the multiple input ports are driven by independent
mutually incoherent optical sources, in effect operating under optical power superposition (see [
7
R. C. J. Hsu, A. Tarighat, A. Shah, A. H. Sayed, and B. Jalali, “Capacity enhancement in coherent optical MIMO (COMIMO) multimode fiber links,” IEEE Comm. Lett.
10, 195–197 (2006). [CrossRef]
] for an exception, treating coherent optical detection MIMO, albeit of prohibitive complexity for short-range optical interconnects).
This paper extends previous MMF MIMO studies in three respects: (i): We explore, for the first time in optical MIMO, the consequences of an Informed Transmitter (IT) strategy, providing feedback paths from the (possibly distributed) Receivers (RX) to the Transmitter (TX), over which the RXs convey their estimates of the Channel State Information (CSI) to the TX (CSIT). (ii): We introduce a broadcast MMF MIMO system (with PTP links as a special case) over short-range (<100m) optical interconnects (iii): Our modulated optical input ports are mutually coherent but Direct-Detection (DD) is used at the output ports.
2. Optical zero-forcing beamforming system overview
We propose a novel optical
Zero-Forcing BeamForming (ZFBF) technique, providing multiple decoupled virtual sub-channels to a large number of independent receivers at multiple optical taps or drops. Such broadcast capability, not available with the IR strategy, is potentially applicable to short-range photonic interconnect structures over multimode waveguides and MMF, in topologies such as buses and stars, i.e. short-reach “mini-PON” distribution networks, wherein each of the receive stations listens to its own targeted data tributary. Our method provides an optical equivalent to ZFBF techniques recently emerging in wireless communication for MIMO broadcast [
8
Q. H. Spencer, A. L. Swindlehurst, and M. Haardt, “Zero-forcing methods for downlink spatial multiplexing in multiuser MIMO channels,” IEEE Trans. Sig.Proc.
52, 461–471 (2004). [CrossRef]
,
9
Y. Taesang and A. Goldsmith, “On the optimality of multiantenna broadcast scheduling using zero-forcing beamforming,” IEEE J. Sel. Areas Commun.
24, 528–541 (2006). [CrossRef]
]. However, unique to the optical predicament, our approach resolves the complication of field interference cross-terms resulting from quadratic detection of mutually coherent modulated optical sources.
Our technique may also be viewed as an extension of the system in [
10
J. Kahn, “Compensating Multimode Fiber Dispersion Using Adaptive Optics,” in OFC ′07 Anaheim, Paper OTuL1 (2007).
], which made use of a
Spatial Light Modulator (SLM) to transmit 10 Gb/s over a 11 Km MMF. As in [
10
J. Kahn, “Compensating Multimode Fiber Dispersion Using Adaptive Optics,” in OFC ′07 Anaheim, Paper OTuL1 (2007).
], we assume a feedback path is available from RX(s) to the TX, which is also equipped with a
Spatial Light Modulator (SLM). However, [
10
J. Kahn, “Compensating Multimode Fiber Dispersion Using Adaptive Optics,” in OFC ′07 Anaheim, Paper OTuL1 (2007).
] is essentially a Multiple Input Single Output (MISO) system making use of spatial diversity to mitigate ISI at a single output port, whereas our MIMO system feeds multiple ports, translating spatial diversity into multiplexing gain.
From an implementation point of view, an alternative to the SLM device required in the TX may be provided by leveraging recent silicon photonics progress to realize an equivalent
MultiPort Modulator (MPM) as a parallel bank of modulators, each of the type described in [
11], all fed by a single optical source (DFB or VCSEL) transmitting mutually coherent, independently modulated multiple optical signals coupled into the input facet of the MMF. The added ZFBF TX complexity enables the simplest possible uncoordinated OOK receivers.
Optical ZFBF concept
: The SLM/MPM shapes the transmitted complex amplitude wavefront with a certain spatial resolution into field patterns referred to as
PseudoModes (PM), each lighting up its own unique detector, while eliciting zero response in other photodetectors (
Fig. 1). These beam-forming PM vectors are transmitted in superposition, each modulated by its own data tributary. As an application we propose a system transmitting 112 Gb/s over a
single MMF link or short-reach PON (
Fig. 2), realized as a massively parallel ZFBF MIMO system, mapping 32 data tributaries, each at 3.5 Gb/s, into 64 optical inputs, leading to 32 RXs, with no cross-talk between one another. In contrast, most MMF MIMO systems to date were 2×2; Exceptions are [
4
Y. Yadin and M. Orenstein, “Parallel optical interconnects over multimode waveguide,” J.Lightwave Technol.
24, 380–386 (2006). [CrossRef]
,
5
M. Greenberg, M. Nazarathy, and M. Orenstein, “Data Parallelization by Optical MIMO Transmission over Multi-Mode Fiber with Inter-Modal Coupling,” in LEOS ′06 Montreal Canada Paper WX 5, (2006).
] using a large number of mutually incoherent optical sources.
Fig. 1. (Color online) ZFBF concept: Each PM A̰
(r) , r = 1,2,…,nU
is focusing light onto the r-th detector.
The range limit of our method is set by intermodal dispersion, estimated to be limited to up to 100 meters at 3.5 Gb/s with modest or no equalization, for many types of Polymer Optical Fibers. As per [
12
P. Pepeljukovski et al, “Data Center and High Performance Computing Interconnects for 100 Gb/s and Beyond,” in OFC ′07 paper OMR4, (2007).
] most data-center applications require <100m distances.
3. Quadratic MIMO channel model
We start by modeling the Quadratic MIMO (Q-MIMO) channel with Multi-Mode Detectors (MMD) for the case of mutually coherent input ports, and direct detection.
Let H denote the MMF random channel matrix (CM), describing the linear transformation of optical field from input to output, with D
0, the number of (speckle / modal) Degrees of Freedom (DOF) at each detector. The complex modal fields vector incident at the nR
detectors inputs, is Ḛd =HḚ
s
where the transmitted field vector Ḛ
s
may be arbitrarily shaped in amplitude and phase by means of an SLM or MPM. It is noted that the elements of the input and output field vectors Ḛ
d
,Ḛ
s
are spatially-integrated and temporally-sampled quantities, representing the continuous-valued space-time quantities in discrete form.
In a MIMO system, multiple extended area detectors, called MultiMode Detectors (MMD) are coupled to the MMF end-facet, either directly attached to it or via intermediary short MMF leads. The origin of the MMD term is due to the fact that each such detector captures multiple field Degrees Of Freedom (DOF) over its aperture, be it speckle elements or orthogonal spatial transverse modes incident on the detector surface.
For simplicity we assume a uniform DOF number D
0 per MMD.
The received field column vector Ḛ
d
is partitioned into vector blocks Ḛ
d(r) , with the r-th vector containing the D
0 field DOFs captured by the r-th detector:
Ḛ
d
= [Ḛ
d(1)T
,…,Ḛ
d(r)T
,…,Ḛ
d(nR
)T
]
T
. Similarly, the CM H of size nRD
0×nT
is partitioned into nR
matrix blocks, H
(r) , called partial CMs, such that Ḛ
d(r) = H
(r)
Ḛ
s
, i.e. the r-th partial CM describes the linear transformation of field from the input vector to the field DOFs vector captured by the r-th MMD. The photocurrent generated by the r-th MMD is
where for simplicity we assumed unity responsivity, equating total power with photocurrent, and in the first equality we used the fact that the total power received by the r-th MMD equals the sum of powers of its individual field DOFs, a consequence of the spatial orthogonality of either the speckle elements or the modes in the fiber lead of each MMD, as applicable. Each noiseless photocurrent is then expressed as a quadratic form, where G
(r) = H
(r)†
H
(r) is called the r-th Quadratic Channel Matrix(QCM). We further introduce MISO responsivities :
The detected photocurrents Id
r
are corrupted by zero-mean additive gaussian noises Nr
~ N[0, σ2
I
], independent of each other and of the transmitted signal: Ir
= Id
r
+ Nr
.
Channel estimation procedure: The QCM is determined by a channel estimation procedure run once per coherence interval (CI), based on transmitting training sequences consisting of unit vectors, then two-element vectors with their components in-phase and in-quadrature:
Launching Ḛ
s
= [δ
p
], i.e. exciting just the p-th input port, p = 1,…,nT
, yields all the QCM diagonal elements: Id
r
(Ḛ
s
) = G
(r)
pp
. Once the diagonal elements are estimated, the off-diagonal elements are derived by exciting all possible nT
(nT
- 1) / 2 distinct pairs of input ports in turn, launching: Ḛ
s
= [δ
p
+ δ
t
], yielding Re[G
(r)
pt
] = [Id
r
(Ḛ
s
) - G
(r)
pp
- G
(r)
tt
] /2 followed by launching all possible pairs of signals in quadrature, Ḛ
s
= [δ
p
+ jδ
t
], yielding Im[G
(r)
pt
] = j[G
(r)
pp
+G
(r)
tt
-Ir
d
(Ḛ
s
)]/2. This completes the off-diagonal elements evaluation. Each training sequence is repeated N times per CI, and the responses are averaged, e.g. N=1000 repetitions provide ~30 dB noise reduction, incurring an overhead of the order of ~5% in multiGb/s scenarios, as the channel varies relatively slowly, CI≈ 25 msec. The channel estimation overhead is not excessive relative to the substantial increases in data-rate (up to ×32 i.e. thousands of %) attained with ZFBF MIMO relative to a SISO channel.
4. Optical Zero-Forcing-Beam-Forming
In this section we construct, a transmission alphabet of nU
PMs, A̰(r),r = 1,…,nU
, to be loaded into the precoder, once per CI, satisfying the ZF Constraint (ZFC):
Physically, the r-th PM, A̰(r), is an input vector lighting up the r-th detector while eliciting zero response in all other Active Detectors (AD) (forming a subset of the full detector set, selected once per each CI, the optimal selection of which is outside the scope). The currents of the non-active detectors are ignored. Here nU
is the “number of users” - the size of a subset of AD over which the ZFBF signaling is conducted in each CI, with the indexes r,r′ running over the ADs. The ADs are re-indexed from 1 to nU
, without loss of generality. PMs can be transmitted in superposition rather than one at a time. The total transmitted field is
where A is a precoder matrix collecting the PMs in its columns, is a scaling coefficient, and s = [s
1,…,sr
,…snU
]
T
is the vector of transmitted info symbols. The photocurrent in the r-th MMD due to the transmitted field (4) is seen to be solely dependent on the r-th transmitted info symbol, Sr
indicating that the transmission has been essentially decoupled into nU
effective channels, referred to as pseudochannels:
Taking the info symbols Sr
to be i.i.d. with zero-mean and unity variance,⟨|Sr
|2⟩=1, the average transmitted power is the sum of PM powers
where ρ
eff
is the harmonic mean of the MISO responsivities over all pseudochannels, and ∥A̰(r)∥ = ρ-1
r
(from (3),(2)). Solving (6) for E
0, under a fixed transmit power PT
, yields E
2
0 = PTρeff
. Let us now define the optical SNR at the r-th detector as γr
=⟨Id
r
⟩/σ
I
(its square is the electrical SNR). The optical SNR is then seen to be uniform over all MMDs
where γU
=PT
/(nUσI
) is the transmit SNR per user.
Optimizing the uniform SNR under an averaged transmit power constraint is equivalent to maximizing ρ
eff
. This optimization decouples into nU
independent maximization problems over each of the ρr
s.t. the ZFC (3).
Let H̅(r) be the r-th interference submatrix obtained by collecting the partial CMs of all the ADs, except the r-th one: H̅(r) =[H
(1)T
,…,H
(r-1)T
,H
(r+1)T
,…,H
(nU)T]
T
. The ZFC (3) then implies that the r-th PM belongs to the right nullspace of this matrix:
An orthonormal basis of this nullspace is arrayed in the columns of a null interference basis matrix
N̅(r). The ZFC (8) is then equivalent to ∃a̰(r): A̰(r) = N̅(r)
a̅(r). Substituting this expression into (2), the r-th MISO responsivity may be recast as:
where g
(r) = N̅(r)†G
(r)
N̅(r) = (H
(r)
N̅(r))†(H
(r)
N̅(r)), and a̰(r) is unconstrained.
The last expression in (9) identified as a Rayleigh Quotient (RQ) [
13
B. Noble and J.W. Daniel, Applied Linear Algebra (Prentice-Hall, 1988).
] known to be optimized by selecting
a̰
(r) as the dominant eigenvector of
g
(r) which coincides with the dominant right singular vector of
H
(r)
N̅
(r), corresponding to the largest
Singular Value (SV), denoted σ
r
. The maximized RQ (9) then equals the dominant squared singular value: ρ
max
r
= σ
2
r
, yielding ∥
A̰
(r)∥
2 = σ
-2
r
, i.e. the optimal PM energies are given by the inverses of the squared singular values. This concludes the construction of the optimal PMs forming the columns of the precoder matrix,
A, leading to maximum uniform SNR at all output ports,
γr
= ρ
eff
γU
, given by the “transmit SNR per user”
γU
times an effective power gain (6) equal, in the optimal case, to the harmonic mean of the squared SVs of the matrices
H
(r)
N̅
(r).
Dimensionality considerations: The ZFC formulated in (8) for the r-th detector amounts to a set of (nU
-1)D
0 homogeneous linear equations in nT
variables. Its solution space is the nullspace of H̅(r), of dimension nnull
≡ nT
- (nU
- 1)D
0, (as a random H̅(r) matrix tends to have full-row rank almost surely). For non-trivial solutions we require nT
> (nU
-1)D
0 , i.e. the number of transmit ports must exceed the number of zero-forced DOFs captured by all nU
- 1 zero-forced detectors. In our simulations we empirically found it necessary to maintain nnull
/ nT
≥ 0.25 to obtain high pseudochannel responsivity (ρmax
r
) gains.
A rudimentary transmission strategy based on these concepts excites one detector at a time, by selecting one PM per symbol, mapped by log2
nU
bits. A more advanced modulation format called here Orthogonal Vector Amplitude Modulation (OVAM), concurrently sends nU
bits per symbol, signaling over all decoupled pseudochannels, by launching a superposition of independently modulated PMs, each focused onto a different detector. Superposing multiple PMs, each driven by its independent data stream addressed to an individual RX, substantial multiplexing gain is extracted (e.g. 32× in our example), while keeping the RXs simple.
5. System performance
A short-reach (<100m) MMF PON operating at 112 Gb/s is illustrated in
Fig. 2. For simplicity we assume On-Off Keying (OOK) transmission (extension to multi-level transmission is also possible). In the receivers terminating each output port, the photocurrent is one-bit quantized with a conventional OOK CDR (Clock-Data-Recovery) front-end.
We analyze the OVAM OOK BER performance vs. transmit SNR per bit (or equivalently, per user) parameterized by nU
, the number of ADs (or equivalently the number of bits per sym). As the received SNRs γr
are uniform over r, the BER at each RX will be constant (uniform Quality of Service). With electrical AWGN, the BER is expressed as Gaussian-Q function
Fig. 2. Massively parallel transmission of 112 Gb/s over a single MMF Point-to-Multipoint ZFBF MIMO broadcast system. At the input, 32 × 3.5 Gb/s data streams are mapped into 64 optical input ports by means of the proposed ZFBF MIMO precoder. 32 active photodetectors (out of 64) are spread along the output facets of various branches. The r-th photodiode receives its own 3.5 Gb/s data stream from the r-th input, with no cross-talk from the other tributaries. The 32 uncoordinated, decoupled, conventional OOK receivers then detect an aggregate >100 Gb/s. A point-to-point >100 Gb/s link may be similarly constructed, with all 32 detectors co-located at the fiber output facet. All E-O modulators of the Multi-Port E-O Modulator or SLM are fed from a single laser.
with Q-factor Id
∆/(2σ
I
) = ⟨;Ir
d
⟩/σ
I
= γr
= γUρ
eff
where Id
∆ is the “mark” current.
For the OVAM format, carrying 1 b/sym/active-user, the transmit SNR per user γU
coincides with the transmit SNR per bit. In our simulations we drew 1000 channel matrix realizations and computed ρ
eff
(6) as the harmonic mean of the nU
squared SVs σ2
r
, in turn evaluated for each of the matrices H
(r)
N̅(r). The BER per CM realizaton is then given by (10) The total performance is given by the average , vs. γU
(TX SNR per bit) over the CM ensemble.
We simulated two models for the statistics of the CM, H :
A more idealized Orthogonal channel̲, whereby the total received optical power is a fixed fraction η of the transmitted power, drawing matrices H = √ηU with U unitary isotropic. This channel lends itself to a simple analytic description: The SVs are all σ
r
= η1/2, yielding effective responsivity gains ρ
eff
= η < 1, hence (10) yields ortho
= Q[γUη].
Remarkably, for this orthogonal channel the BER vs. SNR–per-bit is independent of the number of ADs (=number of users), as all decoupled pseudochannels attain constant loss through the CM. Hence, a linear growth (in nU
) of the aggregate bitrate at fixed BER may be provided by a linear growth in total TX power (i.e. constant power and bitrate per user), as opposed to M-ary PAM Single Input Single Output (SISO) systems, wherein a linear increase in bitrate requires approximately exponential increase in power (~3 dB optical per bit).
A more realistic
CG-ZMSW channel, whereby
H is drawn as a Circular Gaussian (CG)
Zero-Mean Spatially White (ZMSW) matrix, with CG i.i.d. elements. This CG-ZMSW channel model is ubiquitous in wireless transmission. Its approximate applicability to multimode fiber transmission was established in [
4
Y. Yadin and M. Orenstein, “Parallel optical interconnects over multimode waveguide,” J.Lightwave Technol.
24, 380–386 (2006). [CrossRef]
] and [
7
R. C. J. Hsu, A. Tarighat, A. Shah, A. H. Sayed, and B. Jalali, “Capacity enhancement in coherent optical MIMO (COMIMO) multimode fiber links,” IEEE Comm. Lett.
10, 195–197 (2006). [CrossRef]
].
The simulated BER performance vs. SNR for MIMO, MISO and SISO systems over the two types of channels is shown
Fig. 2 for a 64 mode fiber. An insightful comparative performance of the tradeoff between the spectral efficiency gain and the required SNR at a fixed BER level is displayed in
Fig. 2c, comparing ZFBF MIMO and MISO with conventional M-ary PAM SISO. Substantial multiplexing/diversity gain advantages are apparent for the ZFBF schemes.
6. Conclusions
ZFBF MIMO for MMF point-to-(multi)point configurations, provides substantial multiplexing gain (high spectral efficiencies) to simple uncoordinated receivers, at the expense of increased transmitter complexity and feedback from the receivers.
Fig. 3. (Color online) Performance of the ZFBF MIMO system described in
Fig. 2. (a,b):BER vs. optical SNR per bit (10 log
γU
) for MIMO, MISO, SISO over MMF with η =10 dB loss, for the ZMSW and Ortho. Ch. (a): ZFBF MISO outperforms SISO by ~6 dB, both providing 1 b/sym (b): ZFBF MIMO:
nU
= 2, 4, 8,16, 32 . The curves for
nU
= 4, 8 almost coincide.
Doubling the number of bits per symbol to 16, and then to 32 requires modest power increases in two steps of ~1 dB. (c): Spectral efficiency vs. SNR performance comparison of the novel ZFBF MIMO/MISO vs. multi-level SISO, for a fixed BER of 10
-4.