CL Improvement: NAMES AND PATHS
 

Common Lisp Improvements: NAMES AND PATHS

DISCLAIMER: The code associated to these documents is not completely tested and it is bound to contain errors and omissions. This documentation may contain errors and omissions as well.

Finally, the license under which the code is released is not settled yet, although the file COPYING contains a Berkeley-style license. For the time being you are advised to use the code at your own risk. No warranty whatsoever is provided, the author will not be held responsible for any effect generated by your use of the library, and you can put here the scarier extra disclaimer you can think of.

The following code on two different CL implementations running on the same platform (i.e. machine plus operating system - let's say a Mac OS X) gives two different answers.

    CL implementation A on Mac OS-X> (pathname-name (parse-namestring ".cshrc"))
    ".cshrc"
  

    CL implementation B on Mac OS-X> (pathname-name (parse-namestring ".cshrc"))
    NIL
  

This (and several other cases) is enough to drive CL programmers mad (at least the writer), and is enough to scare away the novice.

The source of all these problems is the underspecified "Pathnames" section of the CL ANSI Standard.

The code coming with these documents provides a portable and unequivocal implementation that gets rid of most of the "implementation dependent" present in PARSE-NAMESTRING and friends.

Specification Rationale

The NAMES-AND-PATHS package defines a facility to manipulate pathnames. The manipulation never assumes accessing a file system of any sort. In particular, issues like "how do I follow a symbolic link in Linux" are irrelevant in the scope of this specification.

The specification and the related code use notions that are available in several CL implementations. The implementation was developed in a "clean room" fashion, without referencing any particular code-base.

The code SHADOWs most of the standard CL pathname functions.

The following steps were necessary to produce the specification and its implementation.

  1. The first, and necessary, step that must be taken is to provide a usable definition of a host. Many implementations do have such a notion buried under layers of packages and history. Note that this change also implies that the notion of logical host must be refined. Moreover, the notion of run-time host must be defined as well.
  2. Secondly, the specification of PARSE-NAMESTRING must be amended, by taking into account the new host definitions and by defining a more standardized way to parse namestrings. This means introducing syntax.
  3. The meaning of the PATHNAME fields, especially the :HOST must then be clarified.
  4. The definition of NAMESTRING and friends, must now take into account the new underlying definitions.

As an additional feature, the new PARSE-NAMESTRING and friends are capable of parsing and returning URI object. "file:///..." URI strings are rendered as pathnames on the loca host.

Finally a set of ancillary utilities, mostly devoted to make the NAMES-AND-PATHS play ball with the underlying implementation was defined as well.

Non (Yet) Specified Behavior

Although the code does respect the ANSI specification and extends it or modifies it, it cannot be used as a "drop-in" replacement for the standard operations. Unfortunately there is no way to ensure that an implementation can be made to obey the conventions set up by the package. While the code can be used in production code, it requires some (minimal) wrappers that may make it a little cumbersome to use. All in all the code is provided essentially as a portable platform for experimentation, with the hope that implementors will accept its conventions and converge on a portable version of the pathname facilities of Common Lisp.

#P Reader Syntax

The #P reader syntax is too delicate to try to change it in any implementation. The NAMES-AND-PATHS package does not redefine the #P syntax. Therefore, entering a pathname in the system using it will result in an implementation defined pathname.

:CASE Not Handled

The :CASE issue is not completely dealt with at this point. It will solved in the future.

Wild Pathnames Not Handled

Wild pathnames beyond those having :WILD and :WILD-INFERIORS elements are not handled. In particular, there is no implementation of TRANSLATE-PATHNAMES. Of course, it will be a valuable addition to the library.

NAMES-AND-PATHS Dictionary

Reference Implementation Status

The reference implementation is still in an embryonic state. There are several unimplemented features and several design decisions to be made. I am asking for help.

Major design decisions to be made:

  • Do we SHADOW completely the PATHNAME class?
  • What default behavior for NIL and :UNSPECIFIC components should we choose? E.g. should UNIX pathnames always have an :UNSPECIFIC device?

Major unimplemented features:

  • NAMESTRING and friends are not completely implemented, pending a decision on the previous item. I.e. right now the implementation tries to "ride" the underlying pathname implementation, but, in doing so, it forgoes having a proper host to dispatch against in a deprecated "implementation dependent" way.

Dependencies

The current implementation depends on the CL-ENVIRONMENT package, which should be loaded separately. This dependency can be removed with some work. Although it gives a nice foundation to work on while dealing with different hosts.

The NAMES-AND-PATHS library reuses the wonderful URI parsing code by Franz Inc.

Test Suite

The NAMES-AND-PATHS library comes with its own test suite. It uses the tester package from Franz Inc.

References

The usual ones. Plus tons of past discussions on C.L.L. and elsewhere.

Site Map

Enjoy!


Questions? Queries? Suggestions? Comments? Please direct them at me.

 

News

News in chronological order, most recent on top.

  • 2005-08-10
    Started the documentation.