Switch to unified view

a b/Code/All PennyLane QML Demos/18 Many Body 208.1s kkawchak.ipynb
1
{
2
 "cells": [
3
  {
4
   "cell_type": "code",
5
   "execution_count": 43,
6
   "metadata": {
7
    "id": "in3VVDRqlMUE",
8
    "tags": []
9
   },
10
   "outputs": [
11
    {
12
     "name": "stdout",
13
     "output_type": "stream",
14
     "text": [
15
      "Time in seconds since beginning of run: 1693283058.6479008\n",
16
      "Tue Aug 29 04:24:18 2023\n"
17
     ]
18
    }
19
   ],
20
   "source": [
21
    "# This cell is added by sphinx-gallery\n",
22
    "# It can be customized to whatever you like\n",
23
    "%matplotlib inline\n",
24
    "# !pip install pennylane\n",
25
    "# !pip install neural_tangents\n",
26
    "# !pip install networkx\n",
27
    "import time\n",
28
    "seconds = time.time()\n",
29
    "print(\"Time in seconds since beginning of run:\", seconds)\n",
30
    "local_time = time.ctime(seconds)\n",
31
    "print(local_time)"
32
   ]
33
  },
34
  {
35
   "cell_type": "markdown",
36
   "metadata": {
37
    "id": "cfjAkbwNlMUF"
38
   },
39
   "source": [
40
    "Machine learning for quantum many-body problems\n",
41
    "===============================================\n",
42
    "\n",
43
    "::: {.meta}\n",
44
    ":property=\\\"og:description\\\": Machine learning for many-body problems\n",
45
    ":property=\\\"og:image\\\":\n",
46
    "<https://pennylane.ai/qml/_images/ml_classical_shadow.png>\n",
47
    ":::\n",
48
    "\n",
49
    "::: {.related}\n",
50
    "tutorial\\_classical\\_shadows Classical Shadows\n",
51
    "tutorial\\_kernel\\_based\\_training Kernel-based training with\n",
52
    "scikit-learn tutorial\\_kernels\\_module Training and evaluating quantum\n",
53
    "kernels\n",
54
    ":::\n",
55
    "\n",
56
    "*Author: Utkarsh Azad --- Posted: 02 May 2022. Last Updated: 09 May\n",
57
    "2022*\n",
58
    "\n",
59
    "Storing and processing a complete description of an $n$-qubit quantum\n",
60
    "mechanical system is challenging because the amount of memory required\n",
61
    "generally scales exponentially with the number of qubits. The quantum\n",
62
    "community has recently addressed this challenge by using the\n",
63
    "`classical shadow <tutorial_classical_shadows>`{.interpreted-text\n",
64
    "role=\"doc\"} formalism, which allows us to build more concise classical\n",
65
    "descriptions of quantum states using randomized single-qubit\n",
66
    "measurements. It was argued in Ref. that combining classical shadows\n",
67
    "with classical machine learning enables using learning models that\n",
68
    "efficiently predict properties of the quantum systems, such as the\n",
69
    "expectation value of a Hamiltonian, correlation functions, and\n",
70
    "entanglement entropies.\n",
71
    "\n",
72
    "![Combining machine learning and classical\n",
73
    "shadows](/demonstrations/ml_classical_shadows/class_shadow_ml.png){.align-center\n",
74
    "width=\"80.0%\"}\n",
75
    "\n",
76
    "In this demo, we describe one of the ideas presented in Ref. for using\n",
77
    "classical shadow formalism and machine learning to predict the\n",
78
    "ground-state properties of the 2D antiferromagnetic Heisenberg model. We\n",
79
    "begin by learning how to build the Heisenberg model, calculate its\n",
80
    "ground-state properties, and compute its classical shadow. Finally, we\n",
81
    "demonstrate how to use\n",
82
    "`kernel-based learning models <tutorial_kernels_module>`{.interpreted-text\n",
83
    "role=\"doc\"} to predict ground-state properties from the learned\n",
84
    "classical shadows. So let\\'s get started!\n",
85
    "\n",
86
    "Building the 2D Heisenberg Model\n",
87
    "--------------------------------\n",
88
    "\n",
89
    "We define a two-dimensional antiferromagnetic [Heisenberg\n",
90
    "model](https://en.wikipedia.org/wiki/Quantum_Heisenberg_model) as a\n",
91
    "square lattice, where a spin-1/2 particle occupies each site. The\n",
92
    "antiferromagnetic nature and the overall physics of this model depend on\n",
93
    "the couplings $J_{ij}$ present between the spins, as reflected in the\n",
94
    "Hamiltonian associated with the model:\n",
95
    "\n",
96
    "$$H = \\sum_{i < j} J_{ij}(X_i X_j + Y_i Y_j + Z_i Z_j) .$$\n",
97
    "\n",
98
    "Here, we consider the family of Hamiltonians where all the couplings\n",
99
    "$J_{ij}$ are sampled uniformly from \\[0, 2\\]. We build a coupling matrix\n",
100
    "$J$ by providing the number of rows $N_r$ and columns $N_c$ present in\n",
101
    "the square lattice. The dimensions of this matrix are $N_s \\times N_s$,\n",
102
    "where $N_s = N_r \\times N_c$ is the total number of spin particles\n",
103
    "present in the model.\n"
104
   ]
105
  },
106
  {
107
   "cell_type": "code",
108
   "execution_count": 44,
109
   "metadata": {
110
    "id": "LWfXJdIBlMUG"
111
   },
112
   "outputs": [],
113
   "source": [
114
    "import itertools as it\n",
115
    "import pennylane.numpy as np\n",
116
    "import numpy as anp\n",
117
    "\n",
118
    "def build_coupling_mats(num_mats, num_rows, num_cols):\n",
119
    "    num_spins = num_rows * num_cols\n",
120
    "    coupling_mats = np.zeros((num_mats, num_spins, num_spins))\n",
121
    "    coup_terms = anp.random.RandomState(24).uniform(0, 2,\n",
122
    "                        size=(num_mats, 2 * num_rows * num_cols - num_rows - num_cols))\n",
123
    "    # populate edges to build the grid lattice\n",
124
    "    edges = [(si, sj) for (si, sj) in it.combinations(range(num_spins), 2)\n",
125
    "                        if sj % num_cols and sj - si == 1 or sj - si == num_cols]\n",
126
    "    for itr in range(num_mats):\n",
127
    "        for ((i, j), term) in zip(edges, coup_terms[itr]):\n",
128
    "            coupling_mats[itr][i][j] = coupling_mats[itr][j][i] = term\n",
129
    "    return coupling_mats"
130
   ]
131
  },
132
  {
133
   "cell_type": "markdown",
134
   "metadata": {
135
    "id": "ZV-J9mFJlMUG"
136
   },
137
   "source": [
138
    "For this demo, we study a model with four spins arranged on the nodes of\n",
139
    "a square lattice. We require four qubits for simulating this model; one\n",
140
    "qubit for each spin. We start by building a coupling matrix `J_mat`\n",
141
    "using our previously defined function.\n"
142
   ]
143
  },
144
  {
145
   "cell_type": "code",
146
   "execution_count": 45,
147
   "metadata": {
148
    "id": "GSqBR9p2lMUG"
149
   },
150
   "outputs": [],
151
   "source": [
152
    "Nr, Nc = 2, 2\n",
153
    "num_qubits = Nr * Nc  # Ns\n",
154
    "J_mat = build_coupling_mats(1, Nr, Nc)[0]"
155
   ]
156
  },
157
  {
158
   "cell_type": "markdown",
159
   "metadata": {
160
    "id": "LMJScKXHlMUG"
161
   },
162
   "source": [
163
    "We can now visualize the model instance by representing the coupling\n",
164
    "matrix as a `networkx` graph:\n"
165
   ]
166
  },
167
  {
168
   "cell_type": "code",
169
   "execution_count": 46,
170
   "metadata": {
171
    "colab": {
172
     "base_uri": "https://localhost:8080/",
173
     "height": 254
174
    },
175
    "id": "PnCxl2OUlMUH",
176
    "outputId": "8383df52-8539-489f-edfc-3064c3cf6eb0"
177
   },
178
   "outputs": [
179
    {
180
     "data": {
181
      "image/png": "\n",
182
      "text/plain": [
183
       "<Figure size 400x400 with 1 Axes>"
184
      ]
185
     },
186
     "metadata": {},
187
     "output_type": "display_data"
188
    }
189
   ],
190
   "source": [
191
    "import matplotlib.pyplot as plt\n",
192
    "import networkx as nx\n",
193
    "\n",
194
    "G = nx.from_numpy_matrix(np.matrix(J_mat), create_using=nx.DiGraph)\n",
195
    "pos = {i: (i % Nc, -(i // Nc)) for i in G.nodes()}\n",
196
    "edge_labels = {(x, y): np.round(J_mat[x, y], 2) for x, y in G.edges()}\n",
197
    "weights = [x + 1.5 for x in list(nx.get_edge_attributes(G, \"weight\").values())]\n",
198
    "\n",
199
    "plt.figure(figsize=(4, 4))\n",
200
    "nx.draw(\n",
201
    "    G, pos, node_color=\"lightblue\", with_labels=True,\n",
202
    "    node_size=600, width=weights, edge_color=\"firebrick\",\n",
203
    ")\n",
204
    "nx.draw_networkx_edge_labels(G, pos=pos, edge_labels=edge_labels)\n",
205
    "plt.show()"
206
   ]
207
  },
208
  {
209
   "cell_type": "markdown",
210
   "metadata": {
211
    "id": "t0E6JAIIlMUH"
212
   },
213
   "source": [
214
    "We then use the same coupling matrix `J_mat` to obtain the Hamiltonian\n",
215
    "$H$ for the model we have instantiated above.\n"
216
   ]
217
  },
218
  {
219
   "cell_type": "code",
220
   "execution_count": 47,
221
   "metadata": {
222
    "colab": {
223
     "base_uri": "https://localhost:8080/",
224
     "height": 0
225
    },
226
    "id": "A_XDopwwlMUH",
227
    "outputId": "adc70e0d-9aa5-4011-b669-1ef80c5451cd"
228
   },
229
   "outputs": [
230
    {
231
     "name": "stdout",
232
     "output_type": "stream",
233
     "text": [
234
      "Hamiltonian =\n",
235
      "  (0.44013459956570355) [X2 X3]\n",
236
      "+ (0.44013459956570355) [Y2 Y3]\n",
237
      "+ (0.44013459956570355) [Z2 Z3]\n",
238
      "+ (1.399024099899152) [X0 X2]\n",
239
      "+ (1.399024099899152) [Y0 Y2]\n",
240
      "+ (1.399024099899152) [Z0 Z2]\n",
241
      "+ (1.920034606671837) [X0 X1]\n",
242
      "+ (1.920034606671837) [Y0 Y1]\n",
243
      "+ (1.920034606671837) [Z0 Z1]\n",
244
      "+ (1.9997345852477584) [X1 X3]\n",
245
      "+ (1.9997345852477584) [Y1 Y3]\n",
246
      "+ (1.9997345852477584) [Z1 Z3]\n"
247
     ]
248
    }
249
   ],
250
   "source": [
251
    "import pennylane as qml\n",
252
    "\n",
253
    "def Hamiltonian(J_mat):\n",
254
    "    coeffs, ops = [], []\n",
255
    "    ns = J_mat.shape[0]\n",
256
    "    for i, j in it.combinations(range(ns), r=2):\n",
257
    "        coeff = J_mat[i, j]\n",
258
    "        if coeff:\n",
259
    "            for op in [qml.PauliX, qml.PauliY, qml.PauliZ]:\n",
260
    "                coeffs.append(coeff)\n",
261
    "                ops.append(op(i) @ op(j))\n",
262
    "    H = qml.Hamiltonian(coeffs, ops)\n",
263
    "    return H\n",
264
    "\n",
265
    "print(f\"Hamiltonian =\\n{Hamiltonian(J_mat)}\")"
266
   ]
267
  },
268
  {
269
   "cell_type": "markdown",
270
   "metadata": {
271
    "id": "JD6aZtNilMUH"
272
   },
273
   "source": [
274
    "For the Heisenberg model, a property of interest is usually the two-body\n",
275
    "correlation function $C_{ij}$, which for a pair of spins $i$ and $j$ is\n",
276
    "defined as the following operator:\n",
277
    "\n",
278
    "$$\\hat{C}_{ij} = \\frac{1}{3} (X_i X_j + Y_iY_j + Z_iZ_j).$$\n"
279
   ]
280
  },
281
  {
282
   "cell_type": "code",
283
   "execution_count": 48,
284
   "metadata": {
285
    "id": "UVBY4jUglMUH"
286
   },
287
   "outputs": [],
288
   "source": [
289
    "def corr_function(i, j):\n",
290
    "    ops = []\n",
291
    "    for op in [qml.PauliX, qml.PauliY, qml.PauliZ]:\n",
292
    "        if i != j:\n",
293
    "            ops.append(op(i) @ op(j))\n",
294
    "        else:\n",
295
    "            ops.append(qml.Identity(i))\n",
296
    "    return ops"
297
   ]
298
  },
299
  {
300
   "cell_type": "markdown",
301
   "metadata": {
302
    "id": "O0-ZNiNXlMUH"
303
   },
304
   "source": [
305
    "The expectation value of each such operator $\\hat{C}_{ij}$ with respect\n",
306
    "to the ground state $|\\psi_{0}\\rangle$ of the model can be used to build\n",
307
    "the correlation matrix $C$:\n",
308
    "\n",
309
    "$${C}_{ij} = \\langle \\hat{C}_{ij} \\rangle = \\frac{1}{3} \\langle \\psi_{0} | X_i X_j + Y_iY_j + Z_iZ_j | \\psi_{0} \\rangle .$$\n"
310
   ]
311
  },
312
  {
313
   "cell_type": "markdown",
314
   "metadata": {
315
    "id": "VhgAsFyJlMUH"
316
   },
317
   "source": [
318
    "Hence, to build $C$ for the model, we need to calculate its ground state\n",
319
    "$|\\psi_{0}\\rangle$. We do this by diagonalizing the Hamiltonian for the\n",
320
    "model. Then, we obtain the eigenvector corresponding to the smallest\n",
321
    "eigenvalue.\n"
322
   ]
323
  },
324
  {
325
   "cell_type": "code",
326
   "execution_count": 49,
327
   "metadata": {
328
    "id": "W-r82wjjlMUH"
329
   },
330
   "outputs": [],
331
   "source": [
332
    "import scipy as sp\n",
333
    "\n",
334
    "ham = Hamiltonian(J_mat)\n",
335
    "eigvals, eigvecs = sp.sparse.linalg.eigs(ham.sparse_matrix())\n",
336
    "psi0 = eigvecs[:, np.argmin(eigvals)]"
337
   ]
338
  },
339
  {
340
   "cell_type": "markdown",
341
   "metadata": {
342
    "id": "Bft7PBtVlMUH"
343
   },
344
   "source": [
345
    "We then build a circuit that initializes the qubits into the ground\n",
346
    "state and measures the expectation value of the provided set of\n",
347
    "observables.\n"
348
   ]
349
  },
350
  {
351
   "cell_type": "code",
352
   "execution_count": 50,
353
   "metadata": {
354
    "id": "aMmNdfzAlMUH"
355
   },
356
   "outputs": [],
357
   "source": [
358
    "dev_exact = qml.device(\"lightning.qubit\", wires=num_qubits) # for exact simulation\n",
359
    "\n",
360
    "def circuit(psi, observables):\n",
361
    "    psi = psi / np.linalg.norm(psi) # normalize the state\n",
362
    "    qml.QubitStateVector(psi, wires=range(num_qubits))\n",
363
    "    return [qml.expval(o) for o in observables]\n",
364
    "\n",
365
    "circuit_exact = qml.QNode(circuit, dev_exact)"
366
   ]
367
  },
368
  {
369
   "cell_type": "markdown",
370
   "metadata": {
371
    "id": "7_OTMOuMlMUH"
372
   },
373
   "source": [
374
    "Finally, we execute this circuit to obtain the exact correlation matrix\n",
375
    "$C$. We compute the correlation operators $\\hat{C}_{ij}$ and their\n",
376
    "expectation values with respect to the ground state $|\\psi_0\\rangle$.\n"
377
   ]
378
  },
379
  {
380
   "cell_type": "code",
381
   "execution_count": 51,
382
   "metadata": {
383
    "id": "M2SqFe0IlMUI"
384
   },
385
   "outputs": [],
386
   "source": [
387
    "coups = list(it.product(range(num_qubits), repeat=2))\n",
388
    "corrs = [corr_function(i, j) for i, j in coups]\n",
389
    "\n",
390
    "def build_exact_corrmat(coups, corrs, circuit, psi):\n",
391
    "    corr_mat_exact = np.zeros((num_qubits, num_qubits))\n",
392
    "    for idx, (i, j) in enumerate(coups):\n",
393
    "        corr = corrs[idx]\n",
394
    "        if i == j:\n",
395
    "            corr_mat_exact[i][j] = 1.0\n",
396
    "        else:\n",
397
    "            corr_mat_exact[i][j] = (\n",
398
    "                np.sum(np.array([circuit(psi, observables=[o]) for o in corr]).T) / 3\n",
399
    "            )\n",
400
    "            corr_mat_exact[j][i] = corr_mat_exact[i][j]\n",
401
    "    return corr_mat_exact\n",
402
    "\n",
403
    "expval_exact = build_exact_corrmat(coups, corrs, circuit_exact, psi0)"
404
   ]
405
  },
406
  {
407
   "cell_type": "markdown",
408
   "metadata": {
409
    "id": "wRhl6GSvlMUI"
410
   },
411
   "source": [
412
    "Once built, we can visualize the correlation matrix:\n"
413
   ]
414
  },
415
  {
416
   "cell_type": "code",
417
   "execution_count": 52,
418
   "metadata": {
419
    "colab": {
420
     "base_uri": "https://localhost:8080/",
421
     "height": 337
422
    },
423
    "id": "elPohZB_lMUI",
424
    "outputId": "de61464c-fc27-4900-bd66-880b847547f7"
425
   },
426
   "outputs": [
427
    {
428
     "data": {
429
      "image/png": "\n",
430
      "text/plain": [
431
       "<Figure size 400x400 with 2 Axes>"
432
      ]
433
     },
434
     "metadata": {},
435
     "output_type": "display_data"
436
    }
437
   ],
438
   "source": [
439
    "fig, ax = plt.subplots(1, 1, figsize=(4, 4))\n",
440
    "im = ax.imshow(expval_exact, cmap=plt.get_cmap(\"RdBu\"), vmin=-1, vmax=1)\n",
441
    "ax.xaxis.set_ticks(range(num_qubits))\n",
442
    "ax.yaxis.set_ticks(range(num_qubits))\n",
443
    "ax.xaxis.set_tick_params(labelsize=14)\n",
444
    "ax.yaxis.set_tick_params(labelsize=14)\n",
445
    "ax.set_title(\"Exact Correlation Matrix\", fontsize=14)\n",
446
    "\n",
447
    "bar = fig.colorbar(im, pad=0.05, shrink=0.80    )\n",
448
    "bar.set_label(r\"$C_{ij}$\", fontsize=14, rotation=0)\n",
449
    "bar.ax.tick_params(labelsize=14)\n",
450
    "plt.show()"
451
   ]
452
  },
453
  {
454
   "cell_type": "markdown",
455
   "metadata": {
456
    "id": "X7AxsPAnlMUI"
457
   },
458
   "source": [
459
    "Constructing Classical Shadows\n",
460
    "==============================\n"
461
   ]
462
  },
463
  {
464
   "cell_type": "markdown",
465
   "metadata": {
466
    "id": "NTPT4XUFlMUI"
467
   },
468
   "source": [
469
    "Now that we have built the Heisenberg model, the next step is to\n",
470
    "construct a\n",
471
    "`classical shadow <tutorial_classical_shadows>`{.interpreted-text\n",
472
    "role=\"doc\"} representation for its ground state. To construct an\n",
473
    "approximate classical representation of an $n$-qubit quantum state\n",
474
    "$\\rho$, we perform randomized single-qubit measurements on $T$-copies of\n",
475
    "$\\rho$. Each measurement is chosen randomly among the Pauli bases $X$,\n",
476
    "$Y$, or $Z$ to yield random $n$ pure product states $|s_i\\rangle$ for\n",
477
    "each copy:\n",
478
    "\n",
479
    "$$|s_{i}^{(t)}\\rangle \\in \\{|0\\rangle, |1\\rangle, |+\\rangle, |-\\rangle, |i+\\rangle, |i-\\rangle\\}.$$\n",
480
    "\n",
481
    "$$S_T(\\rho) = \\big\\{|s_{i}^{(t)}\\rangle: i\\in\\{1,\\ldots, n\\},\\ t\\in\\{1,\\ldots, T\\} \\big\\}.$$\n",
482
    "\n",
483
    "Each of the $|s_i^{(t)}\\rangle$ provides us with a snapshot of the state\n",
484
    "$\\rho$, and the $nT$ measurements yield the complete set $S_{T}$, which\n",
485
    "requires just $3nT$ bits to be stored in classical memory. This is\n",
486
    "discussed in further detail in our previous demo about\n",
487
    "`classical shadows <tutorial_classical_shadows>`{.interpreted-text\n",
488
    "role=\"doc\"}.\n"
489
   ]
490
  },
491
  {
492
   "cell_type": "markdown",
493
   "metadata": {
494
    "id": "voRlU_8blMUI"
495
   },
496
   "source": [
497
    "![](/demonstrations/ml_classical_shadows/class_shadow_prep.png){.align-center\n",
498
    "width=\"100.0%\"}\n"
499
   ]
500
  },
501
  {
502
   "cell_type": "markdown",
503
   "metadata": {
504
    "id": "nJd4CxtClMUI"
505
   },
506
   "source": [
507
    "To prepare a classical shadow for the ground state of the Heisenberg\n",
508
    "model, we simply reuse the circuit template used above and reconstruct a\n",
509
    "`QNode` utilizing a device that performs single-shot measurements.\n"
510
   ]
511
  },
512
  {
513
   "cell_type": "code",
514
   "execution_count": 53,
515
   "metadata": {
516
    "id": "puR1Cx8WlMUI"
517
   },
518
   "outputs": [],
519
   "source": [
520
    "dev_oshot = qml.device(\"lightning.qubit\", wires=num_qubits, shots=1)\n",
521
    "circuit_oshot = qml.QNode(circuit, dev_oshot)"
522
   ]
523
  },
524
  {
525
   "cell_type": "markdown",
526
   "metadata": {
527
    "id": "729nluzslMUI"
528
   },
529
   "source": [
530
    "Now, we define a function to build the classical shadow for the quantum\n",
531
    "state prepared by a given $n$-qubit circuit using $T$-copies of\n",
532
    "randomized Pauli basis measurements\n"
533
   ]
534
  },
535
  {
536
   "cell_type": "code",
537
   "execution_count": 54,
538
   "metadata": {
539
    "colab": {
540
     "base_uri": "https://localhost:8080/",
541
     "height": 0
542
    },
543
    "id": "2lKMEDwRlMUI",
544
    "outputId": "73e9bbda-afdd-4cee-fce0-2c611ca063e7"
545
   },
546
   "outputs": [
547
    {
548
     "name": "stdout",
549
     "output_type": "stream",
550
     "text": [
551
      "First five measurement outcomes =\n",
552
      " [[-1.  1.  1. -1.]\n",
553
      " [ 1. -1. -1.  1.]\n",
554
      " [ 1. -1. -1.  1.]\n",
555
      " [ 1.  1.  1. -1.]\n",
556
      " [-1. -1. -1.  1.]]\n",
557
      "First five measurement bases =\n",
558
      " [[2 2 2 1]\n",
559
      " [0 1 0 0]\n",
560
      " [2 2 2 2]\n",
561
      " [1 2 2 2]\n",
562
      " [1 2 2 1]]\n"
563
     ]
564
    }
565
   ],
566
   "source": [
567
    "def gen_class_shadow(circ_template, circuit_params, num_shadows, num_qubits):\n",
568
    "    # prepare the complete set of available Pauli operators\n",
569
    "    unitary_ops = [qml.PauliX, qml.PauliY, qml.PauliZ]\n",
570
    "    # sample random Pauli measurements uniformly\n",
571
    "    unitary_ensmb = np.random.randint(0, 3, size=(num_shadows, num_qubits), dtype=int)\n",
572
    "\n",
573
    "    outcomes = np.zeros((num_shadows, num_qubits))\n",
574
    "    for ns in range(num_shadows):\n",
575
    "        # for each snapshot, extract the Pauli basis measurement to be performed\n",
576
    "        meas_obs = [unitary_ops[unitary_ensmb[ns, i]](i) for i in range(num_qubits)]\n",
577
    "        # perform single shot randomized Pauli measuremnt for each qubit\n",
578
    "        outcomes[ns, :] = circ_template(circuit_params, observables=meas_obs)\n",
579
    "\n",
580
    "    return outcomes, unitary_ensmb\n",
581
    "\n",
582
    "\n",
583
    "outcomes, basis = gen_class_shadow(circuit_oshot, psi0, 100, num_qubits)\n",
584
    "print(\"First five measurement outcomes =\\n\", outcomes[:5])\n",
585
    "print(\"First five measurement bases =\\n\", basis[:5])"
586
   ]
587
  },
588
  {
589
   "cell_type": "markdown",
590
   "metadata": {
591
    "id": "aM9kSgoElMUI"
592
   },
593
   "source": [
594
    "Furthermore, $S_{T}$ can be used to construct an approximation of the\n",
595
    "underlying $n$-qubit state $\\rho$ by averaging over $\\sigma_t$:\n",
596
    "\n",
597
    "$$\\sigma_T(\\rho) = \\frac{1}{T} \\sum_{1}^{T} \\big(3|s_{1}^{(t)}\\rangle\\langle s_1^{(t)}| - \\mathbb{I}\\big)\\otimes \\ldots \\otimes \\big(3|s_{n}^{(t)}\\rangle\\langle s_n^{(t)}| - \\mathbb{I}\\big).$$\n"
598
   ]
599
  },
600
  {
601
   "cell_type": "code",
602
   "execution_count": 55,
603
   "metadata": {
604
    "id": "uhlXzt7IlMUI"
605
   },
606
   "outputs": [],
607
   "source": [
608
    "def snapshot_state(meas_list, obs_list):\n",
609
    "    # undo the rotations done for performing Pauli measurements in the specific basis\n",
610
    "    rotations = [\n",
611
    "        qml.matrix(qml.Hadamard(wires=0)), # X-basis\n",
612
    "        qml.matrix(qml.Hadamard(wires=0)) @ qml.matrix(qml.adjoint(qml.S(wires=0))), # Y-basis\n",
613
    "        qml.matrix(qml.Identity(wires=0)), # Z-basis\n",
614
    "    ]\n",
615
    "\n",
616
    "    # reconstruct snapshot from local Pauli measurements\n",
617
    "    rho_snapshot = [1]\n",
618
    "    for meas_out, basis in zip(meas_list, obs_list):\n",
619
    "        # preparing state |s_i><s_i| using the post measurement outcome:\n",
620
    "        # |0><0| for 1 and |1><1| for -1\n",
621
    "        state = np.array([[1, 0], [0, 0]]) if meas_out == 1 else np.array([[0, 0], [0, 1]])\n",
622
    "        local_rho = 3 * (rotations[basis].conj().T @ state @ rotations[basis]) - np.eye(2)\n",
623
    "        rho_snapshot = np.kron(rho_snapshot, local_rho)\n",
624
    "\n",
625
    "    return rho_snapshot\n",
626
    "\n",
627
    "def shadow_state_reconst(shadow):\n",
628
    "    num_snapshots, num_qubits = shadow[0].shape\n",
629
    "    meas_lists, obs_lists = shadow\n",
630
    "\n",
631
    "    # Reconstruct the quantum state from its classical shadow\n",
632
    "    shadow_rho = np.zeros((2 ** num_qubits, 2 ** num_qubits), dtype=complex)\n",
633
    "    for i in range(num_snapshots):\n",
634
    "        shadow_rho += snapshot_state(meas_lists[i], obs_lists[i])\n",
635
    "\n",
636
    "    return shadow_rho / num_snapshots"
637
   ]
638
  },
639
  {
640
   "cell_type": "markdown",
641
   "metadata": {
642
    "id": "wB7CZQJXlMUI"
643
   },
644
   "source": [
645
    "To see how well the reconstruction works for different values of $T$, we\n",
646
    "look at the\n",
647
    "[fidelity](https://en.wikipedia.org/wiki/Fidelity_of_quantum_states) of\n",
648
    "the actual quantum state with respect to the reconstructed quantum state\n",
649
    "from the classical shadow with $T$ copies. On average, as the number of\n",
650
    "copies $T$ is increased, the reconstruction becomes more effective with\n",
651
    "average higher fidelity values (orange) and lower variance (blue).\n",
652
    "Eventually, in the limit $T\\rightarrow\\infty$, the reconstruction will\n",
653
    "be exact.\n",
654
    "\n",
655
    "![Fidelity of the reconstructed ground state with different shadow sizes\n",
656
    "$T$](/demonstrations/ml_classical_shadows/fidel_snapshot.png){.align-center\n",
657
    "width=\"80.0%\"}\n"
658
   ]
659
  },
660
  {
661
   "cell_type": "markdown",
662
   "metadata": {
663
    "id": "KuzuhucBlMUI"
664
   },
665
   "source": [
666
    "The reconstructed quantum state $\\sigma_T$ can also be used to evaluate\n",
667
    "expectation values $\\text{Tr}(O\\sigma_T)$ for some localized observable\n",
668
    "$O = \\bigotimes_{i}^{n} P_i$, where $P_i \\in \\{I, X, Y, Z\\}$. However,\n",
669
    "as shown above, $\\sigma_T$ would be only an approximation of $\\rho$ for\n",
670
    "finite values of $T$. Therefore, to estimate $\\langle O \\rangle$\n",
671
    "robustly, we use the median of means estimation. For this purpose, we\n",
672
    "split up the $T$ shadows into $K$ equally-sized groups and evaluate the\n",
673
    "median of the mean value of $\\langle O \\rangle$ for each of these\n",
674
    "groups.\n"
675
   ]
676
  },
677
  {
678
   "cell_type": "code",
679
   "execution_count": 56,
680
   "metadata": {
681
    "id": "29rEs_S1lMUI"
682
   },
683
   "outputs": [],
684
   "source": [
685
    "def estimate_shadow_obs(shadow, observable, k=10):\n",
686
    "    shadow_size = shadow[0].shape[0]\n",
687
    "\n",
688
    "    # convert Pennylane observables to indices\n",
689
    "    map_name_to_int = {\"PauliX\": 0, \"PauliY\": 1, \"PauliZ\": 2}\n",
690
    "    if isinstance(observable, (qml.PauliX, qml.PauliY, qml.PauliZ)):\n",
691
    "        target_obs = np.array([map_name_to_int[observable.name]])\n",
692
    "        target_locs = np.array([observable.wires[0]])\n",
693
    "    else:\n",
694
    "        target_obs = np.array([map_name_to_int[o.name] for o in observable.obs])\n",
695
    "        target_locs = np.array([o.wires[0] for o in observable.obs])\n",
696
    "\n",
697
    "    # perform median of means to return the result\n",
698
    "    means = []\n",
699
    "    meas_list, obs_lists = shadow\n",
700
    "    for i in range(0, shadow_size, shadow_size // k):\n",
701
    "        meas_list_k, obs_lists_k = (\n",
702
    "            meas_list[i : i + shadow_size // k],\n",
703
    "            obs_lists[i : i + shadow_size // k],\n",
704
    "        )\n",
705
    "        indices = np.all(obs_lists_k[:, target_locs] == target_obs, axis=1)\n",
706
    "        if sum(indices):\n",
707
    "            means.append(\n",
708
    "                np.sum(np.prod(meas_list_k[indices][:, target_locs], axis=1)) / sum(indices)\n",
709
    "            )\n",
710
    "        else:\n",
711
    "            means.append(0)\n",
712
    "\n",
713
    "    return np.median(means)"
714
   ]
715
  },
716
  {
717
   "cell_type": "markdown",
718
   "metadata": {
719
    "id": "MJrMeks1lMUI"
720
   },
721
   "source": [
722
    "Now we estimate the correlation matrix $C^{\\prime}$ from the classical\n",
723
    "shadow approximation of the ground state.\n"
724
   ]
725
  },
726
  {
727
   "cell_type": "code",
728
   "execution_count": 57,
729
   "metadata": {
730
    "id": "rjSaQJwYlMUI"
731
   },
732
   "outputs": [],
733
   "source": [
734
    "coups = list(it.product(range(num_qubits), repeat=2))\n",
735
    "corrs = [corr_function(i, j) for i, j in coups]\n",
736
    "qbobs = [qob for qobs in corrs for qob in qobs]\n",
737
    "\n",
738
    "def build_estim_corrmat(coups, corrs, num_obs, shadow):\n",
739
    "    k = int(2 * np.log(2 * num_obs)) # group size\n",
740
    "    corr_mat_estim = np.zeros((num_qubits, num_qubits))\n",
741
    "    for idx, (i, j) in enumerate(coups):\n",
742
    "        corr = corrs[idx]\n",
743
    "        if i == j:\n",
744
    "            corr_mat_estim[i][j] = 1.0\n",
745
    "        else:\n",
746
    "            corr_mat_estim[i][j] = (\n",
747
    "                np.sum(np.array([estimate_shadow_obs(shadow, o, k=k+1) for o in corr])) / 3\n",
748
    "            )\n",
749
    "            corr_mat_estim[j][i] = corr_mat_estim[i][j]\n",
750
    "    return corr_mat_estim\n",
751
    "\n",
752
    "shadow = gen_class_shadow(circuit_oshot, psi0, 1000, num_qubits)\n",
753
    "expval_estmt = build_estim_corrmat(coups, corrs, len(qbobs), shadow)"
754
   ]
755
  },
756
  {
757
   "cell_type": "markdown",
758
   "metadata": {
759
    "id": "AtpcKs2dlMUJ"
760
   },
761
   "source": [
762
    "This time, let us visualize the deviation observed between the exact\n",
763
    "correlation matrix ($C$) and the estimated correlation matrix\n",
764
    "($C^{\\prime}$) to assess the effectiveness of classical shadow\n",
765
    "formalism.\n"
766
   ]
767
  },
768
  {
769
   "cell_type": "code",
770
   "execution_count": 58,
771
   "metadata": {
772
    "colab": {
773
     "base_uri": "https://localhost:8080/",
774
     "height": 371
775
    },
776
    "id": "MAviSXyklMUJ",
777
    "outputId": "9187eaa8-7340-49b6-91ac-871d95d1c191"
778
   },
779
   "outputs": [
780
    {
781
     "data": {
782
      "image/png": "\n",
783
      "text/plain": [
784
       "<Figure size 420x400 with 2 Axes>"
785
      ]
786
     },
787
     "metadata": {},
788
     "output_type": "display_data"
789
    }
790
   ],
791
   "source": [
792
    "fig, ax = plt.subplots(1, 1, figsize=(4.2, 4))\n",
793
    "im = ax.imshow(expval_exact-expval_estmt, cmap=plt.get_cmap(\"RdBu\"), vmin=-1, vmax=1)\n",
794
    "ax.xaxis.set_ticks(range(num_qubits))\n",
795
    "ax.yaxis.set_ticks(range(num_qubits))\n",
796
    "ax.xaxis.set_tick_params(labelsize=14)\n",
797
    "ax.yaxis.set_tick_params(labelsize=14)\n",
798
    "ax.set_title(\"Error in estimating the\\ncorrelation matrix\", fontsize=14)\n",
799
    "\n",
800
    "bar = fig.colorbar(im, pad=0.05, shrink=0.80)\n",
801
    "bar.set_label(r\"$\\Delta C_{ij}$\", fontsize=14, rotation=0)\n",
802
    "bar.ax.tick_params(labelsize=14)\n",
803
    "plt.show()"
804
   ]
805
  },
806
  {
807
   "cell_type": "markdown",
808
   "metadata": {
809
    "id": "CvaBmbD-lMUJ"
810
   },
811
   "source": [
812
    "Training Classical Machine Learning Models\n",
813
    "==========================================\n"
814
   ]
815
  },
816
  {
817
   "cell_type": "markdown",
818
   "metadata": {
819
    "id": "P9L3yo0tlMUJ"
820
   },
821
   "source": [
822
    "There are multiple ways in which we can combine classical shadows and\n",
823
    "machine learning. This could include training a model to learn the\n",
824
    "classical representation of quantum systems based on some system\n",
825
    "parameter, estimating a property from such learned classical\n",
826
    "representations, or a combination of both. In our case, we consider the\n",
827
    "problem of using\n",
828
    "`kernel-based models <tutorial_kernel_based_training>`{.interpreted-text\n",
829
    "role=\"doc\"} to learn the ground-state representation of the Heisenberg\n",
830
    "model Hamiltonian $H(x_l)$ from the coupling vector $x_l$, where\n",
831
    "$x_l = [J_{i,j} \\text{ for } i < j]$. The goal is to predict the\n",
832
    "correlation functions $C_{ij}$:\n",
833
    "\n",
834
    "$$\\big\\{x_l \\rightarrow \\sigma_T(\\rho(x_l)) \\rightarrow \\text{Tr}(\\hat{C}_{ij} \\sigma_T(\\rho(x_l))) \\big\\}_{l=1}^{N}.$$\n",
835
    "\n",
836
    "Here, we consider the following kernel-based machine learning model:\n",
837
    "\n",
838
    "$$\\hat{\\sigma}_{N} (x) = \\sum_{l=1}^{N} \\kappa(x, x_l)\\sigma_T (x_l) = \\sum_{l=1}^{N} \\left(\\sum_{l^{\\prime}=1}^{N} k(x, x_{l^{\\prime}})(K+\\lambda I)^{-1}_{l, l^{\\prime}} \\sigma_T(x_l) \\right),$$\n",
839
    "\n",
840
    "where $\\lambda > 0$ is a regularization parameter in cases when $K$ is\n",
841
    "not invertible, $\\sigma_T(x_l)$ denotes the classical representation of\n",
842
    "the ground state $\\rho(x_l)$ of the Heisenberg model constructed using\n",
843
    "$T$ randomized Pauli measurements, and $K_{ij}=k(x_i, x_j)$ is the\n",
844
    "kernel matrix with $k(x, x^{\\prime})$ as the kernel function.\n",
845
    "\n",
846
    "Similarly, estimating an expectation value on the predicted ground state\n",
847
    "$\\sigma_T(x_l)$ using the trained model can then be done by evaluating:\n",
848
    "\n",
849
    "$$\\text{Tr}(\\hat{O} \\hat{\\sigma}_{N} (x)) = \\sum_{l=1}^{N} \\kappa(x, x_l)\\text{Tr}(O\\sigma_T (x_l)).$$\n",
850
    "\n",
851
    "We train the classical kernel-based models using $N = 70$ randomly\n",
852
    "chosen values of the coupling matrices $J$.\n"
853
   ]
854
  },
855
  {
856
   "cell_type": "code",
857
   "execution_count": 59,
858
   "metadata": {
859
    "id": "awX0qjXKlMUJ"
860
   },
861
   "outputs": [],
862
   "source": [
863
    "# imports for ML methods and techniques\n",
864
    "from sklearn.model_selection import train_test_split, cross_val_score\n",
865
    "from sklearn import svm\n",
866
    "from sklearn.kernel_ridge import KernelRidge"
867
   ]
868
  },
869
  {
870
   "cell_type": "markdown",
871
   "metadata": {
872
    "id": "bQWsJyqGlMUJ"
873
   },
874
   "source": [
875
    "First, to build the dataset, we use the function `build_dataset` that\n",
876
    "takes as input the size of the dataset (`num_points`), the topology of\n",
877
    "the lattice (`Nr` and `Nc`), and the number of randomized Pauli\n",
878
    "measurements ($T$) for the construction of classical shadows. The\n",
879
    "`X_data` is the set of coupling vectors that are defined as a stripped\n",
880
    "version of the coupling matrix $J$, where only non-duplicate and\n",
881
    "non-zero $J_{ij}$ are considered. The `y_exact` and `y_clean` are the\n",
882
    "set of correlation vectors, i.e., the flattened correlation matrix $C$,\n",
883
    "computed with respect to the ground-state obtained from exact\n",
884
    "diagonalization and classical shadow representation (with $T=500$),\n",
885
    "respectively.\n"
886
   ]
887
  },
888
  {
889
   "cell_type": "code",
890
   "execution_count": 60,
891
   "metadata": {
892
    "colab": {
893
     "base_uri": "https://localhost:8080/",
894
     "height": 0
895
    },
896
    "id": "oNjK4cvrlMUJ",
897
    "outputId": "64d55c2a-9d38-4349-dbd6-04ab57db94fc"
898
   },
899
   "outputs": [
900
    {
901
     "data": {
902
      "text/plain": [
903
       "((100, 4), (100, 16), (100, 16))"
904
      ]
905
     },
906
     "execution_count": 60,
907
     "metadata": {},
908
     "output_type": "execute_result"
909
    }
910
   ],
911
   "source": [
912
    "def build_dataset(num_points, Nr, Nc, T=500):\n",
913
    "\n",
914
    "    num_qubits = Nr * Nc\n",
915
    "    X, y_exact, y_estim = [], [], []\n",
916
    "    coupling_mats = build_coupling_mats(num_points, Nr, Nc)\n",
917
    "\n",
918
    "    for coupling_mat in coupling_mats:\n",
919
    "        ham = Hamiltonian(coupling_mat)\n",
920
    "        eigvals, eigvecs = sp.sparse.linalg.eigs(ham.sparse_matrix())\n",
921
    "        psi = eigvecs[:, np.argmin(eigvals)]\n",
922
    "        shadow = gen_class_shadow(circuit_oshot, psi, T, num_qubits)\n",
923
    "\n",
924
    "        coups = list(it.product(range(num_qubits), repeat=2))\n",
925
    "        corrs = [corr_function(i, j) for i, j in coups]\n",
926
    "        qbobs = [x for sublist in corrs for x in sublist]\n",
927
    "\n",
928
    "        expval_exact = build_exact_corrmat(coups, corrs, circuit_exact, psi)\n",
929
    "        expval_estim = build_estim_corrmat(coups, corrs, len(qbobs), shadow)\n",
930
    "\n",
931
    "        coupling_vec = []\n",
932
    "        for coup in coupling_mat.reshape(1, -1)[0]:\n",
933
    "            if coup and coup not in coupling_vec:\n",
934
    "                coupling_vec.append(coup)\n",
935
    "        coupling_vec = np.array(coupling_vec) / np.linalg.norm(coupling_vec)\n",
936
    "\n",
937
    "        X.append(coupling_vec)\n",
938
    "        y_exact.append(expval_exact.reshape(1, -1)[0])\n",
939
    "        y_estim.append(expval_estim.reshape(1, -1)[0])\n",
940
    "\n",
941
    "    return np.array(X), np.array(y_exact), np.array(y_estim)\n",
942
    "\n",
943
    "X, y_exact, y_estim = build_dataset(100, Nr, Nc, 500)\n",
944
    "X_data, y_data = X, y_estim\n",
945
    "X_data.shape, y_data.shape, y_exact.shape"
946
   ]
947
  },
948
  {
949
   "cell_type": "markdown",
950
   "metadata": {
951
    "id": "V18wEzs_lMUK"
952
   },
953
   "source": [
954
    "Now that our dataset is ready, we can shift our focus to the ML models.\n",
955
    "Here, we use two different Kernel functions: (i) Gaussian Kernel and\n",
956
    "(ii) Neural Tangent Kernel. For both of them, we consider the\n",
957
    "regularization parameter $\\lambda$ from the following set of values:\n",
958
    "\n",
959
    "$$\\lambda = \\left\\{ 0.0025, 0.0125, 0.025, 0.05, 0.125, 0.25, 0.5, 1.0, 5.0, 10.0 \\right\\}.$$\n",
960
    "\n",
961
    "Next, we define the kernel functions $k(x, x^{\\prime})$ for each of the\n",
962
    "mentioned kernels:\n"
963
   ]
964
  },
965
  {
966
   "cell_type": "markdown",
967
   "metadata": {
968
    "id": "Seuq-zVflMUK"
969
   },
970
   "source": [
971
    "$$k(x, x^{\\prime}) = e^{-\\gamma||x - x^{\\prime}||^{2}_{2}}. \\tag{Gaussian Kernel}$$\n",
972
    "\n",
973
    "For the Gaussian kernel, the hyperparameter\n",
974
    "$\\gamma = N^{2}/\\sum_{i=1}^{N} \\sum_{j=1}^{N} ||x_i-x_j||^{2}_{2} > 0$\n",
975
    "is chosen to be the inverse of the average Euclidean distance $x_i$ and\n",
976
    "$x_j$. The kernel is implemented using the radial-basis function (rbf)\n",
977
    "kernel in the `sklearn` library.\n"
978
   ]
979
  },
980
  {
981
   "cell_type": "markdown",
982
   "metadata": {
983
    "id": "38VnvOMGlMUK"
984
   },
985
   "source": [
986
    "$$k(x, x^{\\prime}) = k^{\\text{NTK}}(x, x^{\\prime}). \\tag{Neural Tangent Kernel}$$\n",
987
    "\n",
988
    "The neural tangent kernel $k^{\\text{NTK}}$ used here is equivalent to an\n",
989
    "infinite-width feed-forward neural network with four hidden layers and\n",
990
    "that uses the rectified linear unit (ReLU) as the activation function.\n",
991
    "This is implemented using the `neural_tangents` library.\n"
992
   ]
993
  },
994
  {
995
   "cell_type": "code",
996
   "execution_count": 61,
997
   "metadata": {
998
    "colab": {
999
     "base_uri": "https://localhost:8080/",
1000
     "height": 0
1001
    },
1002
    "id": "Ekn9atIZlMUK",
1003
    "outputId": "5b0e44dc-a7aa-40a6-f945-2b56e0c57df4"
1004
   },
1005
   "outputs": [],
1006
   "source": [
1007
    "from neural_tangents import stax\n",
1008
    "init_fn, apply_fn, kernel_fn = stax.serial(\n",
1009
    "    stax.Dense(32),\n",
1010
    "    stax.Relu(),\n",
1011
    "    stax.Dense(32),\n",
1012
    "    stax.Relu(),\n",
1013
    "    stax.Dense(32),\n",
1014
    "    stax.Relu(),\n",
1015
    "    stax.Dense(32),\n",
1016
    "    stax.Relu(),\n",
1017
    "    stax.Dense(1),\n",
1018
    ")\n",
1019
    "kernel_NN = kernel_fn(X_data, X_data, \"ntk\")\n",
1020
    "\n",
1021
    "for i in range(len(kernel_NN)):\n",
1022
    "    for j in range(len(kernel_NN)):\n",
1023
    "        kernel_NN.at[i, j].set((kernel_NN[i][i] * kernel_NN[j][j]) ** 0.5)"
1024
   ]
1025
  },
1026
  {
1027
   "cell_type": "markdown",
1028
   "metadata": {
1029
    "id": "GJbMkW7klMUK"
1030
   },
1031
   "source": [
1032
    "For the above two defined kernel methods, we obtain the best learning\n",
1033
    "model by performing hyperparameter tuning using cross-validation for the\n",
1034
    "prediction task of each $C_{ij}$. For this purpose, we implement the\n",
1035
    "function `fit_predict_data`, which takes input as the correlation\n",
1036
    "function index `cij`, kernel matrix `kernel`, and internal kernel\n",
1037
    "mapping `opt` required by the kernel-based regression models from the\n",
1038
    "`sklearn` library.\n"
1039
   ]
1040
  },
1041
  {
1042
   "cell_type": "code",
1043
   "execution_count": 62,
1044
   "metadata": {
1045
    "id": "7qbJKRGglMUK"
1046
   },
1047
   "outputs": [],
1048
   "source": [
1049
    "from sklearn.metrics import mean_squared_error\n",
1050
    "\n",
1051
    "def fit_predict_data(cij, kernel, opt=\"linear\"):\n",
1052
    "\n",
1053
    "    # training data (estimated from measurement data)\n",
1054
    "    y = np.array([y_estim[i][cij] for i in range(len(X_data))])\n",
1055
    "    X_train, X_test, y_train, y_test = train_test_split(\n",
1056
    "        kernel, y, test_size=0.3, random_state=24\n",
1057
    "    )\n",
1058
    "\n",
1059
    "    # testing data (exact expectation values)\n",
1060
    "    y_clean = np.array([y_exact[i][cij] for i in range(len(X_data))])\n",
1061
    "    _, _, _, y_test_clean = train_test_split(kernel, y_clean, test_size=0.3, random_state=24)\n",
1062
    "\n",
1063
    "    # hyperparameter tuning with cross validation\n",
1064
    "    models = [\n",
1065
    "        # Epsilon-Support Vector Regression\n",
1066
    "        (lambda Cx: svm.SVR(kernel=opt, C=Cx, epsilon=0.1)),\n",
1067
    "        # Kernel-Ridge based Regression\n",
1068
    "        (lambda Cx: KernelRidge(kernel=opt, alpha=1 / (2 * Cx))),\n",
1069
    "    ]\n",
1070
    "\n",
1071
    "    # Regularization parameter\n",
1072
    "    hyperparams = [0.0025, 0.0125, 0.025, 0.05, 0.125, 0.25, 0.5, 1.0, 5.0, 10.0]\n",
1073
    "    best_pred, best_cv_score, best_test_score = None, np.inf, np.inf\n",
1074
    "    for model in models:\n",
1075
    "        for hyperparam in hyperparams:\n",
1076
    "            cv_score = -np.mean(\n",
1077
    "                cross_val_score(\n",
1078
    "                    model(hyperparam), X_train, y_train, cv=5,\n",
1079
    "                    scoring=\"neg_root_mean_squared_error\",\n",
1080
    "                )\n",
1081
    "            )\n",
1082
    "            if best_cv_score > cv_score:\n",
1083
    "                best_model = model(hyperparam).fit(X_train, y_train)\n",
1084
    "                best_pred = best_model.predict(X_test)\n",
1085
    "                best_cv_score = cv_score\n",
1086
    "                best_test_score = mean_squared_error(\n",
1087
    "                    best_model.predict(X_test).ravel(), y_test_clean.ravel(), squared=False\n",
1088
    "                )\n",
1089
    "\n",
1090
    "    return (\n",
1091
    "        best_pred, y_test_clean, np.round(best_cv_score, 5), np.round(best_test_score, 5)\n",
1092
    "    )"
1093
   ]
1094
  },
1095
  {
1096
   "cell_type": "markdown",
1097
   "metadata": {
1098
    "id": "IGPcW5sglMUK"
1099
   },
1100
   "source": [
1101
    "We perform the fitting and prediction for each $C_{ij}$ and print the\n",
1102
    "output in a tabular format.\n"
1103
   ]
1104
  },
1105
  {
1106
   "cell_type": "code",
1107
   "execution_count": 63,
1108
   "metadata": {
1109
    "colab": {
1110
     "base_uri": "https://localhost:8080/",
1111
     "height": 0
1112
    },
1113
    "id": "vxZOUxk9lMUK",
1114
    "outputId": "07ed6444-ca91-48c2-fe8c-64b9bba549c8"
1115
   },
1116
   "outputs": [
1117
    {
1118
     "name": "stdout",
1119
     "output_type": "stream",
1120
     "text": [
1121
      "              Correlation                    Gaussian kernel              Neural Tangent kernel\n",
1122
      "               \t C_00 \t|                           [-0.  0.]                          [-0.  0.]\n",
1123
      "               \t C_01 \t|                   [0.08991 0.07814]                  [0.1165  0.08399]\n",
1124
      "               \t C_02 \t|                   [0.10099 0.06564]                  [0.10664 0.08351]\n",
1125
      "               \t C_03 \t|                   [0.09774 0.04564]                  [0.10384 0.06656]\n",
1126
      "               \t C_10 \t|                   [0.08991 0.07814]                  [0.1165  0.08399]\n",
1127
      "               \t C_11 \t|                           [-0.  0.]                          [-0.  0.]\n",
1128
      "               \t C_12 \t|                   [0.11993 0.0337 ]                  [0.1305  0.06696]\n",
1129
      "               \t C_13 \t|                   [0.09644 0.05856]                  [0.0995  0.08167]\n",
1130
      "               \t C_20 \t|                   [0.10099 0.06564]                  [0.10664 0.08351]\n",
1131
      "               \t C_21 \t|                   [0.11993 0.0337 ]                  [0.1305  0.06696]\n",
1132
      "               \t C_22 \t|                           [-0.  0.]                          [-0.  0.]\n",
1133
      "               \t C_23 \t|                   [0.101   0.06974]                  [0.1226  0.07248]\n",
1134
      "               \t C_30 \t|                   [0.09774 0.04564]                  [0.10384 0.06656]\n",
1135
      "               \t C_31 \t|                   [0.09644 0.05856]                  [0.0995  0.08167]\n",
1136
      "               \t C_32 \t|                   [0.101   0.06974]                  [0.1226  0.07248]\n",
1137
      "               \t C_33 \t|                           [-0.  0.]                          [-0.  0.]\n"
1138
     ]
1139
    }
1140
   ],
1141
   "source": [
1142
    "kernel_list = [\"Gaussian kernel\", \"Neural Tangent kernel\"]\n",
1143
    "kernel_data = np.zeros((num_qubits ** 2, len(kernel_list), 2))\n",
1144
    "y_predclean, y_predicts1, y_predicts2 = [], [], []\n",
1145
    "\n",
1146
    "for cij in range(num_qubits ** 2):\n",
1147
    "    y_predict, y_clean, cv_score, test_score = fit_predict_data(cij, X_data, opt=\"rbf\")\n",
1148
    "    y_predclean.append(y_clean)\n",
1149
    "    kernel_data[cij][0] = (cv_score, test_score)\n",
1150
    "    y_predicts1.append(y_predict)\n",
1151
    "    y_predict, y_clean, cv_score, test_score = fit_predict_data(cij, kernel_NN)\n",
1152
    "    kernel_data[cij][1] = (cv_score, test_score)\n",
1153
    "    y_predicts2.append(y_predict)\n",
1154
    "\n",
1155
    "# For each C_ij print (best_cv_score, test_score) pair\n",
1156
    "row_format = \"{:>25}{:>35}{:>35}\"\n",
1157
    "print(row_format.format(\"Correlation\", *kernel_list))\n",
1158
    "for idx, data in enumerate(kernel_data):\n",
1159
    "    print(\n",
1160
    "        row_format.format(\n",
1161
    "            f\"\\t C_{idx//num_qubits}{idx%num_qubits} \\t| \",\n",
1162
    "            str(data[0]),\n",
1163
    "            str(data[1]),\n",
1164
    "        )\n",
1165
    "    )"
1166
   ]
1167
  },
1168
  {
1169
   "cell_type": "markdown",
1170
   "metadata": {
1171
    "id": "W8OQJN7blMUK"
1172
   },
1173
   "source": [
1174
    "Overall, we find that the models with the Gaussian kernel performed\n",
1175
    "better than those with NTK for predicting the expectation value of the\n",
1176
    "correlation function $C_{ij}$ for the ground state of the Heisenberg\n",
1177
    "model. However, the best choice of $\\lambda$ differed substantially\n",
1178
    "across the different $C_{ij}$ for both kernels. We present the predicted\n",
1179
    "correlation matrix $C^{\\prime}$ for randomly selected Heisenberg models\n",
1180
    "from the test set below for comparison against the actual correlation\n",
1181
    "matrix $C$, which is obtained from the ground state found using exact\n",
1182
    "diagonalization.\n"
1183
   ]
1184
  },
1185
  {
1186
   "cell_type": "code",
1187
   "execution_count": 64,
1188
   "metadata": {
1189
    "colab": {
1190
     "base_uri": "https://localhost:8080/",
1191
     "height": 1000
1192
    },
1193
    "id": "EpnjzM4WlMUK",
1194
    "outputId": "54130309-f89b-4dcd-97fa-626b40d1a169"
1195
   },
1196
   "outputs": [
1197
    {
1198
     "data": {
1199
      "image/png": "\n",
1200
      "text/plain": [
1201
       "<Figure size 1400x1400 with 10 Axes>"
1202
      ]
1203
     },
1204
     "metadata": {},
1205
     "output_type": "display_data"
1206
    }
1207
   ],
1208
   "source": [
1209
    "fig, axes = plt.subplots(3, 3, figsize=(14, 14))\n",
1210
    "corr_vals = [y_predclean, y_predicts1, y_predicts2]\n",
1211
    "plt_plots = [1, 14, 25]\n",
1212
    "\n",
1213
    "cols = [\n",
1214
    "    \"From {}\".format(col)\n",
1215
    "    for col in [\"Exact Diagonalization\", \"Gaussian Kernel\", \"Neur. Tang. Kernel\"]\n",
1216
    "]\n",
1217
    "rows = [\"Model {}\".format(row) for row in plt_plots]\n",
1218
    "\n",
1219
    "for ax, col in zip(axes[0], cols):\n",
1220
    "    ax.set_title(col, fontsize=18)\n",
1221
    "\n",
1222
    "for ax, row in zip(axes[:, 0], rows):\n",
1223
    "    ax.set_ylabel(row, rotation=90, fontsize=24)\n",
1224
    "\n",
1225
    "for itr in range(3):\n",
1226
    "    for idx, corr_val in enumerate(corr_vals):\n",
1227
    "        shw = axes[itr][idx].imshow(\n",
1228
    "            np.array(corr_vals[idx]).T[plt_plots[itr]].reshape(Nr * Nc, Nr * Nc),\n",
1229
    "            cmap=plt.get_cmap(\"RdBu\"), vmin=-1, vmax=1,\n",
1230
    "        )\n",
1231
    "        axes[itr][idx].xaxis.set_ticks(range(Nr * Nc))\n",
1232
    "        axes[itr][idx].yaxis.set_ticks(range(Nr * Nc))\n",
1233
    "        axes[itr][idx].xaxis.set_tick_params(labelsize=18)\n",
1234
    "        axes[itr][idx].yaxis.set_tick_params(labelsize=18)\n",
1235
    "\n",
1236
    "fig.subplots_adjust(right=0.86)\n",
1237
    "cbar_ax = fig.add_axes([0.90, 0.15, 0.015, 0.71])\n",
1238
    "bar = fig.colorbar(shw, cax=cbar_ax)\n",
1239
    "\n",
1240
    "bar.set_label(r\"$C_{ij}$\", fontsize=18, rotation=0)\n",
1241
    "bar.ax.tick_params(labelsize=16)\n",
1242
    "plt.show()"
1243
   ]
1244
  },
1245
  {
1246
   "cell_type": "markdown",
1247
   "metadata": {
1248
    "id": "EY4jZ8bPlMUK"
1249
   },
1250
   "source": [
1251
    "Finally, we also attempt to showcase the effect of the size of training\n",
1252
    "data $N$ and the number of Pauli measurements $T$. For this, we look at\n",
1253
    "the average root-mean-square error (RMSE) in prediction for each kernel\n",
1254
    "over all two-point correlation functions $C_{ij}$. Here, the first plot\n",
1255
    "looks at the different training sizes $N$ with a fixed number of\n",
1256
    "randomized Pauli measurements $T=100$. In contrast, the second plot\n",
1257
    "looks at the different shadow sizes $T$ with a fixed training data size\n",
1258
    "$N=70$. The performance improvement seems to be saturating after a\n",
1259
    "sufficient increase in $N$ and $T$ values for all two kernels in both\n",
1260
    "the cases.\n"
1261
   ]
1262
  },
1263
  {
1264
   "cell_type": "markdown",
1265
   "metadata": {
1266
    "id": "E2XL6tE-lMUK"
1267
   },
1268
   "source": [
1269
    "![image](/demonstrations/ml_classical_shadows/rmse_training.png){width=\"47.0%\"}\n",
1270
    "\n",
1271
    "![image](/demonstrations/ml_classical_shadows/rmse_shadow.png){width=\"47.0%\"}\n"
1272
   ]
1273
  },
1274
  {
1275
   "cell_type": "markdown",
1276
   "metadata": {
1277
    "id": "NGuMg3dVlMUK"
1278
   },
1279
   "source": [
1280
    "Conclusion\n",
1281
    "==========\n",
1282
    "\n",
1283
    "This demo illustrates how classical machine learning models can benefit\n",
1284
    "from the classical shadow formalism for learning characteristics and\n",
1285
    "predicting the behavior of quantum systems. As argued in Ref., this\n",
1286
    "raises the possibility that models trained on experimental or quantum\n",
1287
    "data data can effectively address quantum many-body problems that cannot\n",
1288
    "be solved using classical methods alone.\n"
1289
   ]
1290
  },
1291
  {
1292
   "cell_type": "code",
1293
   "execution_count": 65,
1294
   "metadata": {},
1295
   "outputs": [
1296
    {
1297
     "name": "stdout",
1298
     "output_type": "stream",
1299
     "text": [
1300
      "Time in seconds since end of run: 1693283266.7257466\n",
1301
      "Tue Aug 29 04:27:46 2023\n"
1302
     ]
1303
    }
1304
   ],
1305
   "source": [
1306
    "seconds = time.time()\n",
1307
    "print(\"Time in seconds since end of run:\", seconds)\n",
1308
    "local_time = time.ctime(seconds)\n",
1309
    "print(local_time)"
1310
   ]
1311
  },
1312
  {
1313
   "cell_type": "markdown",
1314
   "metadata": {
1315
    "id": "hkYS-zIJlMUL"
1316
   },
1317
   "source": [
1318
    "References {#ml_classical_shadow_references}\n",
1319
    "==========\n",
1320
    "\n",
1321
    "About the author\n",
1322
    "================\n"
1323
   ]
1324
  }
1325
 ],
1326
 "metadata": {
1327
  "colab": {
1328
   "provenance": []
1329
  },
1330
  "kernelspec": {
1331
   "display_name": "Python 3 (ipykernel)",
1332
   "language": "python",
1333
   "name": "python3"
1334
  },
1335
  "language_info": {
1336
   "codemirror_mode": {
1337
    "name": "ipython",
1338
    "version": 3
1339
   },
1340
   "file_extension": ".py",
1341
   "mimetype": "text/x-python",
1342
   "name": "python",
1343
   "nbconvert_exporter": "python",
1344
   "pygments_lexer": "ipython3",
1345
   "version": "3.10.8"
1346
  },
1347
  "widgets": {
1348
   "application/vnd.jupyter.widget-state+json": {
1349
    "state": {},
1350
    "version_major": 2,
1351
    "version_minor": 0
1352
   }
1353
  }
1354
 },
1355
 "nbformat": 4,
1356
 "nbformat_minor": 4
1357
}