Design and Implementation a simple interpreter NUMEX programming language with Racket.
Definition of NUMEX's syntax:
- If π is a Racket string, then (var π ) is a NUMEX expression (variables).
- If π is a Racket integer, then (num π) is a NUMEX expression (number constants).
- If π is a Racket boolean, then (bool π) is a NUMEX expression (boolean constants).
- If π1 and π2 are NUMEX expressions, then (plus π1 π2) is a NUMEX expression (addition).
- If π1 and π2 are NUMEX expressions, then (minus π1 π2) is a NUMEX expression (subtraction).
- If π1 and π2 are NUMEX expressions, then (mult π1 π2) is a NUMEX expression (multiplication).
- If π1 and π2 are NUMEX expressions, then (div π1 π2) is a NUMEX expression (division).
- If π1 is a NUMEX expression, then (neg π1) is a NUMEX expression (negation).
- If π1 and π2 are NUMEX expressions, then (andalso π1 π2) is a NUMEX expression (logical conjunction).
- If π1 and π2 are NUMEX expressions, then (orelse π1 π2) is a NUMEX expression (logical disjunction).
- If π1, π2, and π3 are NUMEX expressions, then (cnd π1 π2 π3) is a NUMEX expression. It is a condition where the result is π2 if π1 is true, else the result is π3. Only one of π2 and π3 is evaluated.
- If π1 and π2 are NUMEX expressions, then (iseq π1 π2) is a NUMEX expression. (comparison).
- If π1, π2, and π3 are NUMEX expressions, then (ifnzero π1 π2 π3) is a NUMEX expression. It is a condition where the result is π2 if π1 is not zero, else the result is π3. Only one of π2 and π3 is evaluated.
- If π1, π2, π3, and π4 are NUMEX expressions, then (ifleq π1 π2 π3 π4) is a NUMEX expression. It is a conditional where the result is π4 if π1 is strictly greater than π2, else the result is π3. Only one of π3 and π4 is evaluated.
- If π 1 and π 2 are Racket strings and π is a NUMEX expression, then (lam π 1 π 2 π) is a NUMEX expression (a function). In π, π 1 is bound to the function itself (for recursion) and π 2 is bound to the only argument. Also, (lam null π 2 π) is allowed for anonymous nonrecursive functions.
- If π1 and π2 are NUMEX expressions, then (apply π1 π2) is a NUMEX expression (function application).
- If π is a Racket string, and π1 and π2 are NUMEX expressions, then (with π π1 π2) is a NUMEX expression (a let expression where the value of π1 is bound to π in π2).
- If π1 and π2 are NUMEX expressions, then (apair π1 π2) is a NUMEX expression (pair constructor).
- If π1 is a NUMEX expression, then (1st π1) is a NUMEX expression (the first part of a pair).
- If π1 is a NUMEX expression, then (2nd π1) is a NUMEX expression (the second part of a pair).
- (munit) is a NUMEX expression (holding no data, much like () in ML or null in Racket). Notice (munit) is a NUMEX expression, but munit is not.
- If π1 is a NUMEX expression, then (ismunit π1) is a NUMEX expression (testing for (munit)).
- (closure πππ£ π) is a NUMEX value where π is a NUMEX function and πππ£ is an environment that maps variables to values. Closures do not appear in programs; they result from evaluating functions.
- If π 1 is a Racket string and π 2 is a Racket string and π1 is a NUMEX expression and π2 is NUMEX expression and π3 is a NUMEX expression, then (letrec π 1 π1 π 2 π2 π3) is a NUMEX expression (a letrec expression for recursive definitions where the the value of π1 is bound to π 1 and the value of π2 is bound to π 2 in the π3).
- If s is a Racket string and e is a NUMEX expression, then (key s e) is a NUMEX expression (key contructor).
- If k is a NUMEX key and m is a NUMEX munit, then (record k m) is a NUMEX expression (record contructor).
- If k is a NUMEX key and r is a NUMEX record, then (record k r) is a NUMEX expression (record constructor).
- If s is a Racket string and r is a NUMEX record, then (value s r) is a NUMEX expression (value of string in record).
A NUMEX π£πππ’π is a NUMEX number constant, a NUMEX boolean constant, a NUMEX closure, a NUMEX munit, or a NUMEX pair of NUMEX values. Similar to Racket, we can build list values out of nested pair values that end with a NUMEX munit. Such a NUMEX value is called a NUMEX list. You should πππ‘ assume NUMEX programs are syntactically correct (e.g., things like (num "hi") or (num (num 37)) must be handled). And do πππ‘ assume NUMEX programs are free of type errors like (plus (munit) (num 7)), (1st (num 7)) or (div (bool #t) (num 2)).
Problems:
- Warm-Up
(a) Write a Racket function racketlist->numexlist that takes a Racket list, which may even be a list of NUMEX values, and produces a NUMEX list with the same elements in the same order.
(b) Write a Racket function numexlist->racketlist that takes a NUMEX list and produces a Racket list with the same elements in the same order.
- Implementing NUMEX
Write an interpreter for NUMEX. It should be a Racket function eval-exp that takes a NUMEX expression e and either returns the NUMEX value that e evaluates to under the empty environment or calls Racket's error if evaluation encounters a run-time NUMEX type error or unbound NUMEX variables.
An NUMEX expression is evaluated in an environment (for evaluating variables, as usual). In your interpreter, use a Racket list of Racket pairs to represent this environment (which is initially empty) so that you can use the envlookup function, after completing it. Here is a description of the semantics of NUMEX expressions:
- All values (including closures) evaluate to themselves. For example, (eval-exp (num 17)) would return (num 17), not 17.
- A variable evaluates to the value associated with it in the given environment.
- An arithmetic operation (addition, subtraction, multiplication, and division) evaluates to the result of what its operands evaluate to. Note: the operands must be numbers.
- A logical operation (andalso and orelse) evaluates to the result of what its operands evaluate to. Note: short-circuit evaluations are desired, and the operands must be booleans.
- A negation (neg π) evaluates to the opposite (negation) of what π evaluates to. Note: π can be a number or a boolean.
- For (cnd π1 π2 π3), the expression π1 first evaluates to a boolean value. If the resulting value is (bool #t), the whole expression evaluates to what π2 evaluates to. The expression evaluates to the value of π3 otherwise.
- The evaluation of (iseq π1 π2) involves the evaluation of π1 and π2. The resulting values is (bool #t) if the value of π1 equals to the value of π2. Otherwise, the expression evaluates to (bool #f). Note: π1 and π2 can be numbers/booleans.
- For (ifnzero π1 π2 π3), the expression π1 first evaluates to a value. If the resulting value is not zero, the whole expression evaluates to what π2 evaluates to. The expression evaluates to the value of π3 otherwise.
- The evaluation of (ifleq π1 π2 π3 π4) involves the evaluation of π1 and π2. If the value of π1 is strictly greater than the value of π3, then it evaluates to the value of π4. Otherwise, π3 must be evaluated.
- For (with π π1 π2), the expression π2 evaluates to a value in an environment extended to map the name π to the evaluated value of π1.
- Functions are lexically scoped in the sense that a function evaluates to a closure holding the function and the current environment.
- For (apply π1 π2), the expression π1 first evaluates to a value. If the resulting value is not a closure, an error should be arisen. Otherwise, it evaluates the closure's function's body in the closure's environment extended to map the function's name to the closure (unless the name field is null) and the function's argument to the result of the evaluation of π2.
- The (apair π1 π2) construct makes a (new) pair holding the results of the evaluations of π1 and π2.
- If the result of evaluating π1 in (1st π1) is an apair, then the first part is returned. Otherwise, it returns an error. Similarly, the evaluation of (2nd π1) is the second part of the given pair.
- For (ismunit π1), the expression π1 first evaluates to a value. If the resulting value is a munit expression, then the result is the NUMEX value (bool #t), else it is the NUMEX value (bool #f).
- For (letrec π 1 π1 π 2 π2 π3) the expression π3 evaluates to a value in an environment extended to map the name π 1 to the evaluated value of π1 and the name π 2 to the evaluated value of π2.
- If s is a Racket string and the result of evaluating e is a NUMEX expression, the (key s e) construct makes a key holding corresponding value of s which is e. Otherwise, it returns an error.
- If the result of evaluating k is a key and the result of evaluating m is a munit, the (record k m) construct makes a record holding the result of the evaluation of k and the result of the evaluation of m. Otherwise, it returns an error.
- If the result of evaluating k is a key and the result of evaluating r is a record, the (record k r) construct makes a record holding the results of the evaluation of k and the evaluation of r. Otherwise, it returns an error.
- If the result of evaluating s is a string and the result of evaluating r is a record, the (value s r) returns corresponding value of s in r. If s does not exist in r, the (value s r) returns (munit). Otherwise, it returns an error.
- Extending the Language
NUMEX is a small language, but we can write Racket functions that act like NUMEX macros so that users of these functions feel like NUMEX is larger. The Racket functions produce NUMEX expressions that could then be put inside larger NUMEX expressions or passed to evalexp. In implementing these Racket functions, do not use closure (which is used only internally in eval-exp). Also do not use eval-exp (we are creating a program, not running it).
(a) Write a Racket function ifmunit that takes three NUMEX expressions π1, π2, and π3. It returns a NUMEX expression that first evaluates π1. If the resulting value is NUMEX's munit, then it evaluates π2 and that is the overall result. Otherwise, π3 must be evaluated.
(b) Write a Racket function withβ that takes a Racket list of Racket pairs β((π 1 . π1) β¦ (π π . ππ) β¦ (π π . ππ)) and a final NUMEX expression ππ+1. In each pair, assume π π is a Racket string and ππ is a NUMEX expression. withβ returns a NUMEX expression whose value is ππ+1 evaluated in an environment where each π π is a variable bound to the result of evaluating the corresponding ππ for 1 β€ π β€ π. The bindings are done sequentially, so that each ππ is evaluated in an environment where π 1 through π πβ1 have been previously bound to the values π1 through ππβ1.
(c) Write a Racket function ifneq that takes four NUMEX expressions π1, π2, π3, and π4 and returns a NUMEX expression that acts like ifleq except π3 is evaluated if and only if π1 and π2 are not equal numbers/booleans. Otherwise, the whole expression evaluates to what π4 evaluates to. Assume none of the arguments to ifneq use the NUMEX variables _x or _y. Use this assumption so that when an expression returned from ifneq is evaluated, π1 and π2 are evaluated exactly once each.
- Using the Language
We can write NUMEX expressions directly in Racket using the constructors for the structs and (for convenience) the functions we wrote in the previous problem.
(a) Bind to the Racket variable numex-filter a NUMEX function that acts like map (as we use in ML). Your function should be curried: it should take a NUMEX function and return a NUMEX function that takes a NUMEX list and applies the function to every element of the list returning a new NUMEX list with all the elements for which the function returns a number other than zero (causing an error if the function returns a non-number). Recall a NUMEX list is munit or a pair where the second component is a NUMEX list.
(b) Bind to the Racket variable numex-all-gt a NUMEX function that takes a NUMEX number π and returns a NUMEX function that takes a NUMEX list of NUMEX numbers and returns a new NUMEX list of NUMEX numbes containing the elements of the input list (in order) that are greater than π (hint: Use numex-filter).
- Challenging Problem
Write a second version of eval-exp (bound to eval-exp-c) that builds closures with smaller environments: When building a closure, it uses an environment that is like the current environment but holds only variables that are free variables in the function part of the closure. (A free variable is a variable that appears in the function without being under some shadowing binding for the same variable.) Avoid computing a function's free variables more than once. Do this by writing a function compute-free-vars that takes an expression and returns a different expression that uses fun-challenge everywhere in place of fun. The new struct fun-challenge (provided to you; do not change it) has a field freevars to store exactly the set of free variables for the function. Store this set as a Racket set of Racket strings. (Sets are predefined in Racket's standard library; consult the documentation for useful functions such as set, set-add, set-member?, set-remove, set-union, and any other functions you wish.) You must have a top-level function compute-free-vars that works as just described β storing the free variables of each function in the freevars field β so the grader can test it directly. Then write a new βmain partβ of the interpreter that expects the sort of NUMEX expression that compute-free-vars returns. The case for function definitions is the interesting one.