Add files via upload

2019-03-07 16:04:51 -08:00 · 2019-03-07 16:04:51 -08:00 · 171486979a
commit 171486979a
parent 15c89493be
1 changed files with 73 additions and 39 deletions
--- a/ipynb/Cryptarithmetic.ipynb
+++ b/ipynb/Cryptarithmetic.ipynb
@ -4,7 +4,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "<div align=\"right\">Peter Norvig 2014</i></div>\n",
+    "<div align=\"right\">Peter Norvig 2014</div>\n",
    "\n",
    "# Cryptarithmetic (Alphametic) Problems\n",
    "\n",
@ -25,10 +25,11 @@
    "6. Should I find one solution (faster) or all solutions (more complete)? I'll handle both use cases by \n",
    "implementing my function `solve` to return an iterator, which yields solutions one at a time; you can get the first one with `next` or all of them with `set`. \n",
    "\n",
-    "## The solution: `solve`\n",
+    "# The solution: `solve`\n",
    "\n",
    "Below we see that `solve` works by generating every way to replace letters in the formula with numbers,\n",
-    "and then filtering them to keep only valid strings (ones that evaluate to true and have no leading zero)."
+    "and then filtering them to keep only valid strings (ones that evaluate to true and have no leading zero).\n",
+    "The `str.translate` method is used to do the replacements."
   ]
  },
  {
@ -43,22 +44,25 @@
    "def solve(formula):\n",
    "    \"\"\"Given a formula like 'NUM + BER = PLAY', fill in digits to solve it.\n",
    "    Generate all valid digit-filled-in strings.\"\"\"\n",
-    "    return filter(valid, replace_letters(formula.replace(' = ', ' == ')))\n",
+    "    return filter(valid, letter_replacements(formula))\n",
    "\n",
-    "def replace_letters(formula):\n",
-    "    \"\"\"Generate all possible replacements of letters with digits in formula.\"\"\"\n",
-    "    letters = ''.join(set(re.findall('[A-Z]', formula)))\n",
+    "def letter_replacements(formula):\n",
+    "    \"\"\"All possible replacements of letters with digits in formula.\"\"\"\n",
+    "    formula = formula.replace(' = ', ' == ') # Allow = or ==\n",
+    "    letters = cat(set(re.findall('[A-Z]', formula)))\n",
    "    for digits in itertools.permutations('1234567890', len(letters)):\n",
-    "        yield formula.translate(str.maketrans(letters, ''.join(digits)))\n",
+    "        yield formula.translate(str.maketrans(letters, cat(digits)))\n",
    "\n",
    "def valid(exp):\n",
    "    \"\"\"Expression is valid iff it has no leading zero, and evaluates to true.\"\"\"\n",
    "    try:\n",
-    "        return not leading_zero(exp) and eval(exp)\n",
+    "        return not leading_zero(exp) and eval(exp) is True\n",
    "    except ArithmeticError:\n",
    "        return False\n",
    "    \n",
-    "leading_zero = re.compile(r'\\b0[0-9]').search"
+    "cat = ''.join # Function to concatenate strings\n",
+    "    \n",
+    "leading_zero = re.compile(r'\\b0[0-9]').search # Function to check for illegal number"
   ]
  },
  {
@ -69,7 +73,7 @@
    {
     "data": {
      "text/plain": [
-       "'746 + 289 == 1035'"
+       "'489 + 537 == 1026'"
      ]
     },
     "execution_count": 2,
@ -90,8 +94,8 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "CPU times: user 17 s, sys: 68.2 ms, total: 17.1 s\n",
-      "Wall time: 17.2 s\n"
+      "CPU times: user 17.2 s, sys: 61.3 ms, total: 17.3 s\n",
+      "Wall time: 17.4 s\n"
     ]
    },
    {
@ -113,29 +117,52 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "So there are 96 solutions, but `solve` is a bit slow to find them all. How could we make `solve` faster?\n",
+    "# A faster solution: `faster_solve`\n",
    "\n",
-    "## A faster solution: `faster_solve`\n",
-    "\n",
-    "I used `%prun` to get profiling results, and saw that 2/3 of the time is spent in `eval`. So let's eliminate the calls to `eval`. That should be doable, because the expression we are evaluating is basically the same each time, but with different permutations of digits filled in.  We could save a lot of work if we convert the expression into a Python function, compile that function just once, and then call the function for each permutation of digits.  In other words, we want to take an expression such as\n",
+    "Depending on your computer, that probably took 15 or 20 seconds. Can we make it faster? To answer the question, I start by profiling to see where the time is spent. I can use the magic function `%prun` to profile:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      " "
+     ]
+    }
+   ],
+   "source": [
+    "%prun next(solve('NUM + BER = PLAY'))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We see that about 2/3 of the time is spent in `eval`. So let's eliminate the calls to `eval`. That should be doable, because the expression we are evaluating is basically the same each time, but with different permutations of digits filled in.  We could save a lot of work if we convert the expression into a Python function, compile that function just once, and then call the function for each of the 3.6 million permutations of digits.  We want to take an expression such as:\n",
    "\n",
    "    \"NUM + BER == PLAY\"\n",
    "    \n",
-    "and transform it into the Python function\n",
+    "and transform it into the Python function:\n",
    "\n",
-    "\n",
-    "    lambda A,B,E,L,M,N,P,R,U,Y: (100*N+10*U+M) + (100*B+10*E+R) == (1000*P+100*L+10*A+Y)\n",
+    "    (lambda A,B,E,L,M,N,P,R,U,Y: \n",
+    "      (100*N+10*U+M) + (100*B+10*E+R) == (1000*P+100*L+10*A+Y))\n",
    "    \n",
    "Actually that's not quite right.  The rules say that \"N\", \"B\", and \"P\"  cannot be zero. So the function should be:\n",
    "\n",
-    "    A,B,E,L,M,N,P,R,U,Y: B and N and P and ((100*N+10*U+M) + (100*B+10*E+R) == (1000*P+100*L+10*A+Y))\n",
+    "    (lambda A,B,E,L,M,N,P,R,U,Y: \n",
+    "      B and N and P and ((100*N+10*U+M) + (100*B+10*E+R) == (1000*P+100*L+10*A+Y)))\n",
    "\n",
    "Here is the code to compile a formula into a Python function: "
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 5,
   "metadata": {},
   "outputs": [],
   "source": [
@ -144,7 +171,7 @@
    "    in same order as parms of function. For example, 'YOU == ME**2' returns\n",
    "    (lambda E,M,O,U,Y: M and Y and ((100*Y+10*O+U) == (10*M+E)**2), 'YMEUO'\"\"\"\n",
    "    formula      = formula.replace(' = ', ' == ')\n",
-    "    letters      = ''.join(sorted(set(re.findall('[A-Z]', formula))))\n",
+    "    letters      = cat(sorted(set(re.findall('[A-Z]', formula))))\n",
    "    firstletters = sorted(set(re.findall(r'\\b([A-Z])[A-Z]', formula)))\n",
    "    body         = re.sub('[A-Z]+', compile_word, formula)\n",
    "    body         = ' and '.join(firstletters + [body])\n",
@ -164,7 +191,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 6,
   "metadata": {},
   "outputs": [
    {
@ -180,7 +207,7 @@
       "(<function __main__.<lambda>(E, M, O, U, Y)>, 'EMOUY')"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -191,7 +218,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": 7,
   "metadata": {},
   "outputs": [
    {
@ -207,7 +234,7 @@
       "(<function __main__.<lambda>(A, B, E, L, M, N, P, R, U, Y)>, 'ABELMNPRUY')"
      ]
     },
-     "execution_count": 6,
+     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -216,9 +243,16 @@
    "compile_formula(\"NUM + BER = PLAY\", verbose=True)"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now we're ready for the faster version of `solve`:"
+   ]
+  },
  {
   "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": 8,
   "metadata": {},
   "outputs": [],
   "source": [
@ -229,14 +263,14 @@
    "    for digits in itertools.permutations((1,2,3,4,5,6,7,8,9,0), len(letters)):\n",
    "        try:\n",
    "            if fn(*digits):\n",
-    "                yield formula.translate(str.maketrans(letters, ''.join(map(str, digits))))\n",
+    "                yield formula.translate(str.maketrans(letters, cat(map(str, digits))))\n",
    "        except ArithmeticError: \n",
    "            pass"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": 9,
   "metadata": {},
   "outputs": [
    {
@ -245,7 +279,7 @@
       "'587 + 439 = 1026'"
      ]
     },
-     "execution_count": 8,
+     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -256,15 +290,15 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 9,
+   "execution_count": 10,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "CPU times: user 1.3 s, sys: 3.96 ms, total: 1.3 s\n",
-      "Wall time: 1.3 s\n"
+      "CPU times: user 1.32 s, sys: 8.41 ms, total: 1.33 s\n",
+      "Wall time: 1.33 s\n"
     ]
    },
    {
@ -273,7 +307,7 @@
       "96"
      ]
     },
-     "execution_count": 9,
+     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -298,7 +332,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": 11,
   "metadata": {},
   "outputs": [
    {
@ -372,8 +406,8 @@
      "IN + ARCTIC + TERRAIN + AN + ANCIENT + EERIE + ICE + TRACT + I + ENTER + A + TRANCE = FLATIANA| 42 + 379549 + 5877342 + 32 + 3294825 + 88748 + 498 + 57395 + 4 + 82587 + 3 + 573298 = 10354323\n",
      "ONE < TWO < THREE < SEVEN - THREE < THREE + TWO < THREE + THREE < SEVEN < SEVEN + ONE < THREE * THREE| 321 < 483 < 45711 < 91612 - 45711 < 45711 + 483 < 45711 + 45711 < 91612 < 91612 + 321 < 45711 * 45711\n",
      "AN + ACCELERATING + INFERENTIAL + ENGINEERING + TALE + ELITE + GRANT + FEE + ET + CETERA = ARTIFICIAL + INTELLIGENCE| 59 + 577404251698 + 69342491650 + 49869442698 + 1504 + 40614 + 82591 + 344 + 41 + 741425 = 5216367650 + 691400684974\n",
-      "CPU times: user 49.7 s, sys: 216 ms, total: 50 s\n",
-      "Wall time: 50.2 s\n"
+      "CPU times: user 51.2 s, sys: 246 ms, total: 51.4 s\n",
+      "Wall time: 51.7 s\n"
     ]
    },
    {
@ -382,7 +416,7 @@
       "67"
      ]
     },
-     "execution_count": 10,
+     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }