(*<*) theory "paper" imports "Isabelle_DOF.scholarly_paper" begin open_monitor*[this::article] declare[[ strict_monitor_checking = false]] declare[[ Definition_default_class = "definition"]] declare[[ Lemma_default_class = "lemma"]] declare[[ Theorem_default_class = "theorem"]] define_shortcut* csp \ \CSP\ holcsp \ \HOL-CSP\ isabelle \ \Isabelle/HOL\ (*>*) title*[tit::title]\Philosophers may Dine - Definitively!\ author*[safouan,email="\safouan.taha@lri.fr\",affiliation="\LRI, CentraleSupelec\"]\Safouan Taha\ author*[bu,email= "\wolff@lri.fr\",affiliation = "\LRI, Université Paris-Saclay\"]\Burkhart Wolff\ author*[lina,email="\lina.ye@lri.fr\",affiliation="\LRI, Inria, LSV, CentraleSupelec\"]\Lina Ye\ abstract*[abs, keywordlist="[\Shallow Embedding\,\Process-Algebra\, \Concurrency\,\Computational Models\]"] \ The theory of Communicating Sequential Processes going back to Hoare and Roscoe is still today one of the reference theories for concurrent specification and computing. In 1997, a first formalization in \<^isabelle> of the denotational semantics of the Failure/Divergence Model of \<^csp> was undertaken; in particular, this model can cope with infinite alphabets, in contrast to model-checking approaches limited to finite ones. In this paper, we extend this theory to a significant degree by taking advantage of more powerful automation of modern Isabelle version, which came even closer to recent developments in the semantic foundation of \<^csp>. More importantly, we use this formal development to analyse a family of refinement notions, comprising classic and new ones. This analysis enabled us to derive a number of properties that allow to deepen the understanding of these notions, in particular with respect to specification decomposition principles in the infinite case. Better definitions allow to clarify a number of obscure points in the classical literature, for example concerning the relationship between deadlock- and livelock-freeness. As a result, we have a modern environment for formal proofs of concurrent systems that allow to combine general infinite processes with locally finite ones in a logically safe way. We demonstrate a number of resulting verification-techniques for classical, generalized examples: The CopyBuffer and Dijkstra's Dining Philosopher Problem of an arbitrary size. If you consider citing this paper, please refer to @{cite "HOL-CSP-iFM2020"}. \ text\\ section*[introheader::introduction,main_author="Some(@{docitem ''bu''}::author)"]\ Introduction \ text*[introtext::introduction]\ Communicating Sequential Processes (\<^csp>) is a language to specify and verify patterns of interaction of concurrent systems. Together with CCS and LOTOS, it belongs to the family of \<^emph>\process algebras\. \<^csp>'s rich theory comprises denotational, operational and algebraic semantic facets and has influenced programming languages such as Limbo, Crystal, Clojure and most notably Golang @{cite "donovan2015go"}. \<^csp> has been applied in industry as a tool for specifying and verifying the concurrent aspects of hardware systems, such as the T9000 transansputer @{cite "Barret95"}. The theory of \<^csp> was first described in 1978 in a book by Tony Hoare @{cite "Hoare:1985:CSP:3921"}, but has since evolved substantially @{cite "BrookesHR84" and "brookes-roscoe85" and "roscoe:csp:1998"}. \<^csp> describes the most common communication and synchronization mechanisms with one single language primitive: synchronous communication written \_\_\_\. \<^csp> semantics is described by a fully abstract model of behaviour designed to be \<^emph>\compositional\: the denotational semantics of a process \P\ encompasses all possible behaviours of this process in the context of all possible environments \P \S\ Env\ (where \S\ is the set of \atomic events\ both \P\ and \Env\ must synchronize). This design objective has the consequence that two kinds of choice have to be distinguished: \<^enum> the \<^emph>\external choice\, written \_\_\, which forces a process "to follow" whatever the environment offers, and \<^enum> the \<^emph>\internal choice\, written \_\_\, which imposes on the environment of a process "to follow" the non-deterministic choices made. \ text\ Generalizations of these two operators \\x\A. P(x)\ and \\x\A. P(x)\ allow for modeling the concepts of \<^emph>\input\ and \<^emph>\output\: Based on the prefix operator \a\P\ (event \a\ happens, then the process proceeds with \P\), receiving input is modeled by \\x\A. x\P(x)\ while sending output is represented by \\x\A. x\P(x)\. Setting choice in the center of the language semantics implies that deadlock-freeness becomes a vital property for the well-formedness of a process, nearly as vital as type-checking: Consider two events \a\ and \b\ not involved in a process \P\, then \(a\P \ b\P) \{a,b}\ (a\P \ b\P)\ is deadlock free provided \P\ is, while \(a\P \ b\P) \{a,b}\ (a\P \ b\P)\ deadlocks (both processes can make "ruthlessly" an opposite choice, but are required to synchronize). Verification of \<^csp> properties has been centered around the notion of \<^emph>\process refinement orderings\, most notably \_\\<^sub>F\<^sub>D_\ and \_\_\. The latter turns the denotational domain of \<^csp> into a Scott cpo @{cite "scott:cpo:1972"}, which yields semantics for the fixed point operator \\x. f(x)\ provided that \f\ is continuous with respect to \_\_\. Since it is possible to express deadlock-freeness and livelock-freeness as a refinement problem, the verification of properties has been reduced traditionally to a model-checking problem for finite set of events \A\. We are interested in verification techniques for arbitrary event sets \A\ or arbitrarily parameterized processes. Such processes can be used to model dense-timed processes, processes with dynamic thread creation, and processes with unbounded thread-local variables and buffers. However, this adds substantial complexity to the process theory: when it comes to study the interplay of different denotational models, refinement-orderings, and side-conditions for continuity, paper-and-pencil proofs easily reach their limits of precision. Several attempts have been undertaken to develop a formal theory in an interactive proof system, mostly in Isabelle/HOL @{cite "Camilleri91" and "tej.ea:corrected:1997" and "IsobeRoggenbach2010" and "DBLP:journals/afp/Noce16"}. This paper is based on @{cite "tej.ea:corrected:1997"}, which has been the most comprehensive attempt to formalize denotational \<^csp> semantics covering a part of Bill Roscoe's Book @{cite "roscoe:csp:1998"}. Our contributions are as follows: \<^item> we ported @{cite "tej.ea:corrected:1997"} from Isabelle93-7 and ancient ML-written proof scripts to a modern Isabelle/HOL version and structured Isar proofs, and extended it substantially, \<^item> we introduced new refinement notions allowing a deeper understanding of the \<^csp> Failure/Divergence model, providing some meta-theoretic clarifications, \<^item> we used our framework to derive new types of decomposition rules and stronger induction principles based on the new refinement notions, and \<^item> we integrate this machinery into a number of advanced verification techniques, which we apply to two generalized paradigmatic examples in the \<^csp> literature, the CopyBuffer and Dining Philosophers@{footnote \All proofs concerning the HOL-CSP 2 core have been published in the Archive of Formal Proofs @{cite "HOL-CSP-AFP"}; all other proofs are available at \<^url>\https://gitlri.lri.fr/burkhart.wolff/hol-csp2.0\. In this paper, all Isabelle proofs are omitted.\}. \ (* % Moreover, decomposition rules of the form: % \begin{center} % \begin{minipage}[c]{10cm} % @{cartouche [display] \C \ A \\<^sub>F\<^sub>D A' \ B \\<^sub>F\<^sub>D B' \ A \S\ B \\<^sub>F\<^sub>D A' \S\ B'\} % \end{minipage} % \end{center} % are of particular interest since they allow to avoid the costly automata-product construction % of model-checkers and to separate infinite sub-systems from finite (model-checkable) ones; however, % their side-conditions \C\ are particularly tricky to work out. Decomposition rules may pave the % way for future tool combinations for model-checkers such as FDR4~@{cite "fdr4"} or % PAT~@{cite "SunLDP09"} based on proof certifications.*) section*["pre"::tc,main_author="Some(@{docitem \bu\}::author)"] \Preliminaries\ text\\ subsection*[cspsemantics::tc, main_author="Some(@{docitem ''bu''})"]\Denotational \<^csp> Semantics\ text\ The denotational semantics (following @{cite "roscoe:csp:1998"}) comes in three layers: the \<^emph>\trace model\, the \<^emph>\(stable) failures model\ and the \<^emph>\failure/divergence model\. In the trace semantics model, a process \P\ is denoted by a set of communication traces, built from atomic events. A trace here represents a partial history of the communication sequence occurring when a process interacts with its environment. For the two basic \<^csp> processes \Skip\ (successful termination) and \Stop\ (just deadlock), the semantic function \\\ of the trace model just gives the same denotation, \<^ie> the empty trace: \\(Skip) = \(Stop) = {[]}\. Note that the trace sets, representing all \<^emph>\partial\ history, is in general prefix closed.\ text*[ex1::math_example, status=semiformal] \ Let two processes be defined as follows: \<^enum> \P\<^sub>d\<^sub>e\<^sub>t = (a \ Stop) \ (b \ Stop)\ \<^enum> \P\<^sub>n\<^sub>d\<^sub>e\<^sub>t = (a \ Stop) \ (b \ Stop)\ \ text\These two processes \P\<^sub>d\<^sub>e\<^sub>t\ and \P\<^sub>n\<^sub>d\<^sub>e\<^sub>t\ cannot be distinguished by using the trace semantics: \\(P\<^sub>d\<^sub>e\<^sub>t) = \(P\<^sub>n\<^sub>d\<^sub>e\<^sub>t) = {[],[a],[b]}\. To resolve this problem, Brookes @{cite "BrookesHR84"} proposed the failures model, where communication traces were augmented with the constraint information for further communication that is represented negatively as a refusal set. A failure \(t, X)\ is a pair of a trace \t\ and a set of events \X\ that a process can refuse if any of the events in \X\ were offered to him by the environment after performing the trace \t\. The semantic function \\\ in the failures model maps a process to a set of refusals. Let \\\ be the set of events. Then, \{([],\)} \ \ Stop\ as the process \Stop\ refuses all events. For Example 1, we have \{([],\\{a,b}),([a],\),([b],\)} \ \ P\<^sub>d\<^sub>e\<^sub>t\, while \{([],\\{a}),([],\\{b}),([a],\),([b],\)} \ \ P\<^sub>n\<^sub>d\<^sub>e\<^sub>t\ (the \_\_\ refers to the fact that the refusals must be downward closed; we show only the maximal refusal sets here). Thus, internal and external choice, also called \<^emph>\nondeterministic\ and \<^emph>\deterministic\ choice, can be distinguished in the failures semantics. However, it turns out that the failures model suffers from another deficiency with respect to the phenomenon called infinite internal chatter or \<^emph>\divergence\.\ text*[ex2::example, status=semiformal] \ The following process \P\<^sub>i\<^sub>n\<^sub>f\ is an infinite process that performs \a\ infinitely many times. However, using the \<^csp> hiding operator \_\_\, this activity is concealed: \<^enum> \P\<^sub>i\<^sub>n\<^sub>f = (\ X. a \ X) \ {a}\ \ text\where \P\<^sub>i\<^sub>n\<^sub>f\ will be equivalent to \\\ in the process cpo ordering. To distinguish divergences from the deadlock process, Brookes and Roscoe proposed failure/divergence model to incorporate divergence traces @{cite "brookes-roscoe85"}. A divergence trace is the one leading to a possible divergent behavior. A well behaved process should be able to respond to its environment in a finite amount of time. Hence, divergences are considered as a kind of a catastrophe in this model. Thus, a process is represented by a failure set \\\, together with a set of divergence traces \\\; in our example, the empty trace \[]\ belongs to \\ P\<^sub>i\<^sub>n\<^sub>f\. The failure/divergence model has become the standard semantics for an enormous range of \<^csp> research and the implementations of @{cite "fdr4" and "SunLDP09"}. Note, that the work of @{cite "IsobeRoggenbach2010"} is restricted to a variant of the failures model only. \ subsection*["isabelleHol"::tc, main_author="Some(@{docitem ''bu''})"]\Isabelle/HOL\ text\ Nowadays, Isabelle/HOL is one of the major interactive theory development environments @{cite "nipkow.ea:isabelle:2002"}. HOL stands for Higher-Order Logic, a logic based on simply-typed \\\-calculus extended by parametric polymorphism and Haskell-like type-classes. Besides interactive and integrated automated proof procedures, it offers code and documentation generators. Its structured proof language Isar is intensively used in the plethora of work done and has been a key factor for the success of the Archive of Formal Proofs (\<^url>\https://www.isa-afp.org\). For the work presented here, one relevant construction is : \<^item> \<^theory_text>\typedef (\\<^sub>1,...,\\<^sub>n)t = E\ It creates a fresh type that is isomorphic to a set \E\ involving \\\<^sub>1,...,\\<^sub>n\ types. Isabelle/HOL performs a number of syntactic checks for these constructions that guarantee the logical consistency of the defined constants or types relative to the axiomatic basis of HOL. The system distribution comes with rich libraries comprising Sets, Numbers, Lists, etc. which are built in this "conservative" way. For this work, a particular library called \<^theory_text>\HOLCF\ is intensively used. It provides classical domain theory for a particular type-class \\::pcpo\, \<^ie> the class of types \\\ for which \<^enum> a least element \\\ is defined, and \<^enum> a complete partial order \_\_\ is defined. For these types, \<^theory_text>\HOLCF\ provides a fixed-point operator \\X. f X\ as well as the fixed-point induction and other (automated) proof infrastructure. Isabelle's type-inference can automatically infer, for example, that if \\::pcpo\, then \(\ \ \)::pcpo\. \ section*["csphol"::tc,main_author="Some(@{docitem ''bu''}::author)", level="Some 2"] \Formalising Denotational \<^csp> Semantics in HOL \ text\\ subsection*["processinv"::tc, main_author="Some(@{docitem ''bu''})"] \Process Invariant and Process Type\ text\ First, we need a slight revision of the concept of \<^emph>\trace\: if \\\ is the type of the atomic events (represented by a type variable), then we need to extend this type by a special event \\\ (called "tick") signaling termination. Thus, traces have the type \(\+\)\<^sup>*\, written \\\<^sup>\\<^sup>*\; since \\\ may only occur at the end of a trace, we need to define a predicate \front\<^sub>-tickFree t\ that requires from traces that \\\ can only occur at the end. Second, in the traditional literature, the semantic domain is implicitly described by 9 "axioms" over the three semantic functions \\\, \\\ and \\\. Informally, these are: \<^item> the initial trace of a process must be empty; \<^item> any allowed trace must be \front\<^sub>-tickFree\; \<^item> traces of a process are \<^emph>\prefix-closed\; \<^item> a process can refuse all subsets of a refusal set; \<^item> any event refused by a process after a trace \s\ must be in a refusal set associated to \s\; \<^item> the tick accepted after a trace \s\ implies that all other events are refused; \<^item> a divergence trace with any suffix is itself a divergence one \<^item> once a process has diverged, it can engage in or refuse any sequence of events. \<^item> a trace ending with \