How To Implement Functional List In Scala

Photo by drmakete lab

Functional programming has been getting more and more popular these days. A critical distinction about functional programming from imperative is that you cannot update variables or modify mutable data structures. There can be a lot of benefits on having all data immutable.

However, with great benefits, it also comes with some cost. For instance, it is a higher learning curve to adapt this paradigm into your team, and also another paradigm shift in how you define your data structure.

Many questions arise when we need to define a functional data structure - how do we define data structure and operate on them? How do we make every data structure pure?

In this article, I like to share how you can define a widely used data structure in scala, List, which is equivalent to a singly-LinkedList in Java. We start with the thought process on how you can think about data structure as immutable and proceed to define next function, which chains one node to the next in List.


The functional data structure operates on using pure function - it cannot change the data in place or do any side effects. Therefore, creating a functional list for an empty list (Nil) is the same for its immutability as an integer 3 or 4.When we do an operation on an integer, such as 3+4, we get a new integer 7. We didn’t change the value of integer 3 or integer 4 because it is immutable.

The functional List is the same.

When we concatenate 2 list together, List(1) ++ List(2), we will return a new List of List(1,2) without mutating List(1) or List(2).

Does that mean there is much deep copy from the previous two operations to the new ones?

Surprisingly, it doesn’t.

Since every single element is immutable, we can reuse it. When we concatenate two lists together, we return a new list by having one list pointing to the other. It is called data sharing. By doing so, we can implement functions much more efficiently by not having to worry about following code modifying our data.

Usually, the data structure is recursive - they have a base case, when the data structure is empty, such as Nil for empty List, Leaf for trees. Then, they have another type that represents the data structure that consists of value. In the case of List, it is Cons, which contains a single member variable and a function that connects the current Node to the next. In the case of Binary Tree, Branch is equivalent to the Node in the tree that has left and right that connects to the next Branch. We use this intuition to build our functional List.


How do we implement a List?

In Java, we create a class LinkedList and an inner class node that operates inside the LinkedList. The Node has a next members variable that points to the next Node.

We do the same when creating a functional List by creating the LinkedList, which is a MyList.

sealed trait MyList[+A]  {
    // create a right associative function
    def ::[B >: A](a:B):MyList[B]

We create top-level MyList that Empty List, Nil, and Cons will implements. The reason for creating sealed trait is to make our List pattern matchable later with Cons and Nil. Another reason is to create a sum type ADT.

Let’s create Nil:

case object Nil extends MyList[Nothing] {
  override def ::[B >: Nothing](a: B): MyList[B] = Cons(a, Nil)

And Cons:

case class Cons[+A](head:A, tail:MyList[A]) extends MyList[A] {
  override def ::[B >: A](a: B): MyList[B] = Cons(a, head :: tail)

Notice that both Cons and Nil both extend MyList. The head in Cons is equivalent to the value, Node, in Java, and tail refer to the next in the Node in Java.

Another observation from the above code is that there is a recursive nature when we are defining ::. When we define 5 :: List(3), the Cons recursively call :: on the tail until it hits the base case Nil. Nil l create the Cons by attaching the parameter in :: to Cons(a,Nil).

The recursive nature in List appears the same in other functional data structures as well. The base type,Nil, calls the intermediary type, Cons, and constructs the base element from the intermediary type. The intermediary function recursively is calling itself, and let the base type take care of the base scenario. Both of the types extend a trait that consists of the data structure.

Let’s create MyList for data constructor of our new List:

object MyList{
  def apply[A](a:A*): MyList[A] = if(a.isEmpty) Nil else Cons(a.head, apply(a.tail: _*))

Then, we can try testing our List in the main function:

object Main extends App {
  val a = 5 :: Nil
  val b = 7 :: 6 :: a


Main Takeaway

  • The functional data structure uses data sharing instead of doing a deep copy each time you do the operation with them.
  • The data structure is recursive. There always be the base type and the intermediary type that connect the container to form a new structure.
  • Usually, the base type creates its own by implement the function from the intermediary type. In the case of List, Nil creates :: from Cons.

All the source code on this tutorial are here.

Like this Article?

Sign up for my newsletter to get notified for new articles!

Related Posts

5 Anti Pattern for Writing Code in a Functional Programming Language

No 1. Nested Asynchronous Function

Why Do Functional Programmers Prefer For-Comprehension Over Imperative Code Block

Short Answer - Readability

How to Turn Domain Model into DynamoDB AttributeValue

A brief introduction about Dynosaur

Functional Programming has made My Job Easier as a Software Engineer. Here's Why.

Type level system able to let me sleep well at night

5 Functional Programming Side Projects To Deepen Your Functional Programming Skills

Practice Makes Perfect