Modules

TODO: This proposal is incomplete and in-progress. Any of this may change and we have not committing to shipping anything.

Modules aggregate Dart libraries into larger collections that can be compiled separately. A modular compiler can compile each module independently and the declarative module structure enables a compiler to know which Dart source changes require which modules to be rebuilt.

Modules also intend to make large-scale code maintainance and reuse clearer and more robust.

A module specifies:

A set of Dart libraries that it contains.
Its dependencies, the set of other modules that it depends on.
Whether or not the module is public or private to the package it belongs to.

There are a couple of restrictions:

A module may only contain libraries from the same package.
A library may only import (or export or part) libraries that are either in its own module, or in modules that the library's module directly depends on.
Module dependencies may not have cycles. The module dependency graph is a DAG. (However, within a module, libraries may freely import each other, including cycles.)
A module cannot depend on a private module from another package.

The first two rules mean that there is a strict nesting: packages contain modules which contain libraries (which may contain parts).

In return for following these rules, a change to a library in module A will never affect the static analysis, compilation, or runtime behavior of a library in module B unless A is a dependency of B (directly or indirectly). A "downstream" change to some code never requires anything "upstream" of it to be recompiled or reanalyzed. Given a set of modules, a tool or compiler can process them separately in topological order and will never need to reprocess a module.

Motivation

TODO

Authoring modules

Packages must declare all their dependencies in the package's pubspec. Libraries must declare all of their imports at the top of the file. It would be a real pain if users also had to explicitly author the boundaries and dependencies of every single module too.

To avoid that, module boundaries and dependencies are inferred automatically when possible. By default every Dart library is its own module. Creating a library implicitly creates a new module for it. This defaults modules that are fairly fine-grained and whose scope aligns with the construct users are already familiar with.

Module names

Every module has a name, which is a dotted identifier. The name can be whatever you want. It is local to the package and does not have to be globally unique. It just needs to be different from the names of other modules in the same package.

By default, the implicit module for each library is named based on the library's path from the package root and the library's base name. So a library at lib/src/set.dart implicitly goes into a module named lib.src.set. The library in test/set_test.dart implicitly creates a module named test.set_test.

TODO: Is this the right rule? What about dots in file names?

Module boundaries

A library can provide an explicit module name using a library directive with an in clause like:

library in some.module.name;

All libraries in the same package with the same module name are grouped into the same module.

Module dependencies

A Dart compiler needs to know which modules depend on which others to know what order to process or compile them. These dependencies are inferred from the imports (and exports) of the libraries in the module. A module M depends on all of the modules that contain libraries that the libraries in M import or export.

Since dependencies are inferred, error messages from prohibited cyclic dependencies among module could be confusing. We suggest that tools mention the specific libraries whose imports cause the module dependency when explaining the error.

Privacy

Identifiers starting with _ are private in Dart. Declarations named with a leading underscore cannot be accessed outside of the library where it appears. Semantically, private names behave as if the leading underscore is replaced with a unique mangled name based on the library where the name appears.

Instead of using libraries as the privacy boundary, we extend it to the module. All libraries within a module can access private declarations from any other library in the same module: they can call private top-level functions, construct private classes, override private methods, etc. It is as if the _ is mangled based on the module's name instead of the library's.

This does not mean that libraries in the same module share the same top-level namespace. A private declaration in library A is not accessible to library B in the same module unless B explicitly imports A. Each library still controls its own namespace. It's just that if a library imports another in the same module, it can then see private names from that other library.

It is already an established pattern in Dart to locate multiple class declarations in the same library so that they can share access to private state and behavior. This lets users extend that pattern across multiple files without having to use part files.

Code generation

This should also address many of the limitations of using part files for generated code. Code generation often uses parts so that the main library can access private declarations in the generated library (or vice versa). But this means the generated part file cannot have its own imports and those have to be hand-authored in the main library. With this, the generated code could be in a separate library but in the same module as the hand-authored library. It can then access private members in the main library but contain its own imports.

Friend modules

Clear box testing refers to unit tests that validate not just the external public API of a class or library, but its private state and implementation as well. The Dart language currently doesn't have good support for this. Since tests are separate libraries from the code under test, any API being tested must be visible so the test can see it. But that makes the API visible to all external users of the library as well.

Our analysis tools provide some support for clear box testing through the @visibleForTesting annotation. This can be placed on a public declaration and users will get a static warning if the declaration is used anywhere but tests. But this is only a tooling-level feature. The language itself doesn't enforce this.

We can provide easier support by allowing a test module to directly access private declarations. To enable that and other patterns, we allow a module to declare itself a friend to another module.

Any one library in the module can add a library directive with a friend clause indicating the name of the module this module friends:

library friend some.other.module;

Private identifiers in friend modules are visible to all libraries in both modules. Friendship is transitive. If module A is a friend of B which is a friend of C, then A, B, and C, all have access to each other's private identifiers.

Allowing any module to unilaterally declare itself a friend of another could break the encapsulation of package APIs and make the ecosystem fragile. To avoid that, a module can only declare itself a friend of another module in the same package. This lets test modules declare themselves friends of library modules, but prohibits breaking package encapsulation.

It is a compile-time error if multiple libraries in the same module have friend clauses. A module can only friend one other module. (But a module can be a friend of multiple other modules. Friendship forms a tree where all modules in the tree share the same private names.)

Library directive syntax

Taking the above into account, the grammar for the library directive is:

libraryName ::= metadata 'library'
    dottedIdentifierList?
    ( 'in' dottedIdentifierList )?
    ( 'friend' dottedIdentifierList )? ';'

Capability controls on types

There are three fundamental kinds of entities in Dart's semantics:

A class has a set of member declarations and a superclass (which may be Object). You can use a class construct new instances (if not abstract) and/or you can extend one as a superclass.
An interface is a set of member signatures.
A mixin is a set of member declarations. Unlike a class, a mixin does not have a superclass. You have to apply the mixin to some concrete superclass in order to get a class that you can construct.

Dart's syntax somewhat obscures this. There is no dedicated syntax for declaring an interface. Until recently there was also no syntax for declaring a mixin. Instead, a class declaration can, unless prohibited by its own structure, be used as an interface, superclass, or mixin.

Inferring interfaces from classes and mixins a useful tool to avoid the redundancy found in Java and C# code. It provides consumers of the class maximum flexibility. But it comes at a cost:

Since a class may be used as an interface, adding a method is potentially a breaking change, even if the author never intended the class's interface to be implemented. In practice, many class maintainers document how the class should be used and don't consider it a breaking change (and thus don't change the package's major version) if they change a class in a way would break users not following that documentation.
When a class implicitly permits anything, it can be hard to tell how it is intended to be used. Restricting the options can provide a simpler, more guided API.
Changes to a class can break one of its capabilities. If you change a generative constructor to a factory constructor, that will break any subclasses that chained to that constructor. Since the language doesn't know whether or not you intend that class to be subclassed in other packages, it can't alert you to the consequences of that change.
In order to get exhaustiveness checking on pattern matching, we need some notion of a sealed family of types. Otherwise, there's no way to tell if a switch case has covered all types.

Because of these, users ask for control over the affordances a declaration provides (349, 704, 987). Modules are a natural boundary for those restrictions.

Note that the above problems only come into play when unknown code works with a type. Thus these restrictions only apply to code using a type outside of the module where the type is declared. Inside the types's own module, you are free to use types however you want. It's your code.

Types of types and capabilities

Restating the above, there are four affordances a type might offer: construct, extend, implement, and mix in. An analysis of the class declarations in Google's corpus shows these combinations are most common:

Construct               63.93% 6605
(none)                  14.09% 1456
Implement                9.77% 1009
Extend                   6.47%  668
Construct + Implement    2.36%  244
Mixin                    1.25%  129

All other combinations are less than 1%. Every combination occurs in practice, though the few examples of combinations involving mixins and classes seem to be historical from the time before Dart's dedicated mixin syntax.

Mixins and classes

Combinations of classes and interfaces make sense. Likewise, it seems natural to derive an interface from a mixin. But deriving both a class and a mixin from the same declaration has proven to be confusing.

A mixin, by definition, has no superclass. In order to construct or extend something, it must be a full-formed class with an inheritance chain all the way up to Object. When you derive a mixin from a class today, Dart discards the superclass and any inherited methods.

This is a continuing source of confusion for users, which is one reason we added dedicated mixin syntax. Now that we have language versioning, we can complete that transition. With this proposal, a class declaration no longer defines an implicit mixin declaration. The only way to create a mixin is using mixin.

Syntax

We support all combinations of the above four capabilities, except for combinations with Mixin + Construct or Mixin + Extend.

Following Dart's existing syntax, we use class to define classes, mixin define mixins, and abstract to prevent constructing. Following Java and others, we use interface (as a modifier here) to allow implementing. Following Swift and Kotlin, we use open to allow subclassing.

The updated grammar is:

topLevelDeclaration ::=
    classDeclaration
  | mixinDeclaration
  // existing rules...

classDeclaration ::= 'open'? 'interface'? 'abstract'? 'class' identifier
  typeParameters? superclass? interfaces?
  '{' (metadata classMemberDeclaration)* '}'

mixinDeclaration ::= 'interface'? 'mixin' identifier typeParameters?
  ('on' typeNotVoidList)? interfaces?
  '{' (metadata classMemberDeclaration)* '}'

That yields these combinations:

class                         // 63.93% Construct
abstract class                // 14.09% (none)
interface abstract class      //  9.77% Implement
open abstract class           //  6.47% Extend
interface class               //  2.36% Implement Construct
mixin                         //  1.25% Mix-in
open class                    //  0.86% Extend Construct
open interface abstract class //  0.76% Implement Extend
open interface class          //  0.20% Implement Extend Construct
interface mixin               //  0.09% Mix-in Implement

Using abstract class to opt out of constructing is pretty verbose, especially when combined with other modifiers. We could conceivably drop it from classes that have interface:

interface abstract class      -> interface
open interface abstract class -> open interface

But it might be confusing that a type declared using only interface does in fact define a class that may have concrete methods and even be constructed and used as a class inside the module. Dart users today are already used to using abstract class to declare interfaces, so this isn't too much of a stretch. Also, this leaves interface available as potential future syntax for defining a pure interface, if a need for such thing should arrive.

Static semantics

Within a module, despite all the new modifiers, the semantics are roughly the same. The rules for using a type within its module are as permissive as possible and are based on the structure of the type itself, as in current Dart:

It is a compile-time error to invoke a generative constructor of a class if the class defines or inherits any unimplemented abstract members. You can directly construct abstract classes internally if it wouldn't cause a problem to do so. Mixins never have generative cosntructors.
It is a compile-time error to extend a class that has at least one factory constructor and no generative constructors. It is a compile time error to extend a mixin.
It is a compile-time error to mix in a class.

The rules for using types outside of their module are based on the capabilities the type explicitly permits:

It is a compile-time error to invoke a generative constructor of a class marked abstract outside of the module where the class is defined.
It is a compile-time error for a class to appear in an extends clause outside of the module where the class is defined unless the class is marked open.
It is a compile-time error for a type to appear in an implements clause outside of the module where the type is defined unless the type is marked interface.
It is a compile-time error for a class to appear in a with clause.

We also want to make sure the type structurally supports any capability it claims to offer. This helps package maintainers catch mistakes where they inadvertently break a capability that the type offers.

It is a compile-time error if a non-abstract class contains an abstract method or inherits an abstract method with no corresponding implementation. Since a non-abstract class declares that code outside the module can construct it, this rule ensures that it is safe to do so. Abstract classes and mixins may contain both abstract and non-abstract members.
It is a compile-time error if a public-named type marked class does not have a public-named constructor. The constructor can be a default or factory constructor, and can be unnamed.
It is a compile-time error if a public-named type marked interface has any private members. This is to avoid the problem where an external implementation of an interface may omit private members that the module then assumes it can call when given an instance of that interface.

TODO: Is this too much of a restriction?
It is a compile-time error if a class C marked interface has a superclass D which is not also marked interface, unless C and D are declared in the same module. In other words, someone can't extend a class with an interface that they don't control and then retroactively expose its interface by way of a subclass. This ensures that if you declare a class C with no interface, then any object of type C will reliably be an instance of your actual class C or some other type you control.

Capabilities on legacy classes

The above syntax means that it an error to implement, mixin, or extend a class declared just using class. This would break nearly all existing Dart code if it were retroactively applied to existing code.

Fortunately, we have language versioning to help. Dart libraries still at the language version before modules will behave as if all class declarations are implicitly marked with all of the capabilities the class can support. In particular:

All classes and mixins are treated as implicitly marked interface.
If the class has at least one generative constructor (which may be default), it is treated as implicitly marked open.
If the class has no non-default generative constructors, and Object as superclass, it continues to expose an implicit mixin.

When updating a library to the language version that supports modules, you'll want to decide what capabilities to offer, or just place all the modifiers you need to preserve the class's current behavior.

Migrating a class that is used both as a class and a mixin is harder. For that, you will have to migrate it to two separate declarations and give one of them a different name. Fortunately, classes used this way are very rare.

TODO: Investigate tooling to automatically migrate.

Capability controls on members

TODO: Do we want 'final' non-overridable members?

TODO: Protected?

Implicit modules and legacy code

TODO

Questions

Here are some design questions you might ask:

Why not infer module boundaries based on import cycles?

Our existing build systems automatically resolve import cycles by collecting all libraries in a cycle together and creating build targets for each strongly connected component. We could do something similar for modules. But module boundaries affect compile errors around class access restrictions and privacy. I think it would be surprising to users if adding or removing a single import spontaneously changed how private names get resolved or caused compile errors around invalid extends or implements clauses to appear or disappear.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature-specification.md

feature-specification.md

Modules

Motivation

Authoring modules

Module names

Module boundaries

Module dependencies

Privacy

Code generation

Friend modules

Library directive syntax

Capability controls on types

Types of types and capabilities

Mixins and classes

Syntax

Static semantics

Capabilities on legacy classes

Capability controls on members

Implicit modules and legacy code

Questions

Why not infer module boundaries based on import cycles?

Files

feature-specification.md

Latest commit

History

feature-specification.md

File metadata and controls

Modules

Motivation

Authoring modules

Module names

Module boundaries

Module dependencies

Privacy

Code generation

Friend modules

Library directive syntax

Capability controls on types

Types of types and capabilities

Mixins and classes

Syntax

Static semantics

Capabilities on legacy classes

Capability controls on members

Implicit modules and legacy code

Questions

Why not infer module boundaries based on import cycles?