add file content #577

BrentBlanckaert · 2025-01-11T19:19:17Z

No description provided.

…make stdin more consistent

… consitent

BrentBlanckaert · 2025-02-18T18:45:26Z

Documentation for usage of files

This documentation will discuss all the changes made regarding
the IO of a test.

Files

Files are used to describe files that can be added as input for a test and will provide the student with this file in Dodona. This is done in the following way:

files:
  - name: "animal.txt"
    url: "media/workdir/animal.txt"

files is not to be confused with file which will specify the contents and location of a file the code of a student should generate. This can be done in the following way:

file:
  content: "animal.txt" # is the content that the file should have
  location: "media/workdir/animal.txt" # is the location of the file
  oracle: ...

There are several issues with this:

The usage of the names files and file is very confusing
In content you had to use a file and can't just specify the content
You're able to have multiple input files but can only have one output file.
Need more consistency in the naming and formatting

The name files was changed to input_files and file was changed to output_files. The name url in files and location in file were also changed to path. You can now also specify multiple files for the output files. An example is the following:

input_files:
  - name: "animal.txt"
    path: "media/workdir/animal.txt"
  - name: "human.txt"
    path: "media/workdir/human.txt"
output_files:
  data: 
    - content: "lion" 
      path: "media/workdir/animal.txt" 
    - content: "tim"
      path: "media/workdir/human.txt"
  oracle: ....
output_files:
  - content: "animal" 
    path: "media/workdir/animal.txt" 
  - content: "humant"
    path: "media/workdir/human.txt"

You can still only specify paths in the content section of output files.
We can distinguish between actual content and the path the a file that contains it by using !path.
An example is the following:

output_files:
  data: 
    - content: !path "animal.txt" 
      path: "media/workdir/animal.txt" 
    - content: "Humans can make music and a warm meal"
      path: "media/workdir/human.txt"
  oracle: ....

So content will now expect the actual content by default and not a path to load it from.

For the feedback, it's all still a bit fuzzy because right now all the content is dumped after eachother. Potential solution:

Usage of tabs in the solutions
Only show the names and show content when clicking on them.

Most of that will probably need to happen on Dodona itself.

Stdin, Stdout and Stderr

How things currently work, you have to specify the full contents of the stdin, stdout and stderr channels. This can get ugly, when that's a lot of text. This is why the usage of files is also very benificial here.

Example for Stdin:

stdin: !path "media/workdir/animal.txt"

The usage of !path is also present here. This is consistent with what was discussed above.

Under the hood, Stderr and Stdout are both just textual output channels. So they both have the exact same functionality.
If they are a dictionary, they used to expect the key data, but now you can also use content which is more consistent with the rest. Just like before you can also use !path to specify that you want to use a file instead of directly specifying the content.

Examples for Stdout:

stdout: !path "media/workdir/animal.txt"

stdout: 
  content: !path "media/workdir/animal.txt"
  config: ...
  oracle: ...

jorg-vr · 2025-02-26T09:32:43Z

First review based on the text.

I have a feeling the naming scheme is still a bit inconsistent.

The input_files seems quite logical:

input_files: List of files
  - name:  The name of the file to be used in the code (eg to pass to a function)
    path: The path to the content of the file in the exercise directory

But output_files have me confused.
You specify path and content for each file, but it is completely unclear to me how you can have both at the same time.

What I would have expected based on seeing input_files:

output_files: List of files
  - name: The name of the file the student should write it's output in
    path: The path of the expected output file
    content: Alternative to path, specify the expected content of the file inline here

It would even be great if this content variable was also available for input_files.

Instead path and content are specified at the same time, no name variable is present. And content seems to contain either a filename, the actual content of the file or an explicit path using the !path tag.

I like the !path tag option for content, and it works well and consistent for stdin and stdout, but I really don't see how we can the specify path at the same time. It confuses me and will thus probably also confuse our users.

I left out the oracle case in the above example, as that looks fine.

jorg-vr

I didn't specify it every time it is relevant, but how to properly handle the name changes seems like the biggest open issue. It will probably be best if we also discus this IRL tomorrow

jorg-vr · 2025-02-26T09:42:31Z

tested/dsl/schema-strict.json

@@ -39,7 +39,7 @@
        }
      ],
      "properties" : {
-        "files" : {


Sadly, we'll preferably remain backwards compatible here.
So while we can introduce a new improved naming scheme with input_files and output_files, we'll have to keep supporting the old scheme at the same time.

We could also think about starting to give deprecation notices, and/or automatically updating all existing exercises. (We have done these scripted updates before, but these have to be well timed, and with a growing number of users, we have a growing risk of confusing some, if we don't give proper deprecation warnings)

tested/dsl/translate_parser.py

tested/oracles/text.py

tested/dsl/translate_parser.py

niknetniko

The code itself looks OK, added some comments here and there.

More high-level:

Jorg is right that we need backwards compatibility
Your examples are slightly confusing, since the paths in the output_files will not be to the media dir, but rather the name of a file that is either specified in the assignment or as a parameter. So rather something like

output_files:
  data: 
    - content: "lion" 
      path: "animal.txt" 
    - content: "tim"
      path: "human.txt"

And when using a path as "content", it will probably be to a file inside in the evaluation folder of the exercise.

tested/testsuite.py

niknetniko · 2025-03-01T14:10:49Z

tested/dsl/translate_parser.py

@@ -148,6 +168,7 @@ def _parse_yaml(yaml_stream: str) -> YamlObject:
            yaml.add_constructor("!" + actual_type, _custom_type_constructors, loader)
    yaml.add_constructor("!expression", _expression_string, loader)
    yaml.add_constructor("!oracle", _return_oracle, loader)
+    yaml.add_constructor("!path", _path_string, loader)


I am not against adding another custom tag, but I do think we need to be slightly careful here, since we also expose all "TESTed types" as tags, this would prevent us from ever having a "path" type in TESTed.

An alternative is working with plain objects, e.g.

- path: "animal.txt" content: type: "path" path: "media/workdir/animal.txt"

Again, not against it, just something to consider.

tested/testsuite.py

niknetniko · 2025-03-01T14:20:27Z

tested/testsuite.py

@@ -528,7 +535,7 @@ def get_functions(self) -> Iterable[FunctionCall]:

 @define(frozen=True)
 class FileUrl:
-    url: str
+    path: str


The reason this was named url is that this is a path to the media folder of an exercise, since it is linked on Dodona. The "generated "file" stuff should all have paths to the evaluation folder instead. Now, there is an argument to be made that this distinction is unnecessary, but that is done at the Dodona side (I think, it has been a while, so this should be checked).

But to be clear, I am not against renaming this, just giving some context.

…ists. Also fixed the tests

…generated pairs

tested/oracles/text.py

@@ -3,12 +3,20 @@
 """

 import math
+import os


To fix the problem, we should remove the unused import statement for the os module. This will clean up the code and remove the unnecessary dependency, making the code easier to read and maintain.

Locate the import statement for the os module on line 6.

Remove the line import os from the file tested/oracles/text.py.

tested/oracles/text.py

+from tested.testsuite import (
+    FileOutputChannel,
+    OutputChannel,
+    OutputFileData,
+    TextChannelType,
+    TextOutputChannel,
+)


To fix the problem, we need to remove the unused import statement for FileOutputChannel. This will clean up the code and remove unnecessary dependencies, making the code easier to read and maintain.

Locate the import statement for FileOutputChannel in the file tested/oracles/text.py.

Remove the FileOutputChannel from the import statement on line 13.

Ensure that no other parts of the code rely on this import.

BrentBlanckaert added 2 commits January 11, 2025 19:45

changed some names

99a26c5

fixed some tests

667fd20

BrentBlanckaert mentioned this pull request Dec 20, 2024

Planned extensions for Brent's master's thesis #559

Open

67 tasks

BrentBlanckaert self-assigned this Jan 11, 2025

BrentBlanckaert added the enhancement New feature or request label Jan 11, 2025

BrentBlanckaert added 13 commits January 12, 2025 11:34

fixed a test

4a07970

have something that works for stdin

e834e9f

have something working for stdout/stderr

791836e

fixed all tests

ef29394

fixed some small issues

a00f09e

made something that can be used to generate and evaluate multiple files.

ee2123c

Updated output files to be more consistent and do better check. Also …

1b3b892

…make stdin more consistent

Made inlining for stdout and stderr possible and made everything more…

32bb7d9

… consitent

Fixed some minor issues

2afdd67

Fixed the dsl yaml tests

e01ebdc

Fixed buildin oracle tests

2ce7892

Fixed the io function

3a62f6d

Fixed linting issue

a845fb2

BrentBlanckaert added 8 commits February 18, 2025 20:53

fixed last tests

e939f30

Added an extra test

10bd418

Added some more tests

1d3aebd

lint test

2cb31f5

covered an extra case

2ce6701

added an extra case for invalid input

70e3f21

fixed test

441fc8b

fixed linting

9254c14

BrentBlanckaert marked this pull request as ready for review February 25, 2025 16:20

BrentBlanckaert requested review from niknetniko and jorg-vr February 25, 2025 16:20

jorg-vr requested changes Feb 26, 2025

View reviewed changes

cleaned up code

3003cfc

jorg-vr reviewed Feb 27, 2025

View reviewed changes

tested/dsl/translate_parser.py Outdated Show resolved Hide resolved

niknetniko reviewed Mar 1, 2025

View reviewed changes

BrentBlanckaert added 14 commits March 5, 2025 15:25

fixed a small error

723c1a6

Added a better temporary better seperator

af868cf

Changed FileOutputChannel to work with list of objects instead of 3 l…

6d59d54

…ists. Also fixed the tests

fix tests

e841287

changed the input_files

d3e7d46

Fixed some of the issues

8e92928

Fixed linting

9e3e0a6

Fixed test

7bec950

Added link support for stdin

b6fe8e1

Added a few more checks

621f4a7

Have a current version to work with multiple output files

317505a

cleaned up some code and used urls in stdin too

aa29560

Made attempt for stdout/stderr

2d1e568

changed names again and splitted output files over multiple expected/…

497f452

…generated pairs

github-advanced-security bot found potential problems Mar 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add file content #577

add file content #577

BrentBlanckaert commented Jan 11, 2025

BrentBlanckaert commented Feb 18, 2025 •

edited

Loading

jorg-vr commented Feb 26, 2025

jorg-vr left a comment

jorg-vr Feb 26, 2025

niknetniko left a comment

niknetniko Mar 1, 2025

niknetniko Mar 1, 2025

Provide additional feedback

Please help us improve GitHub Copilot by sharing more details about this comment.

Provide additional feedback

Please help us improve GitHub Copilot by sharing more details about this comment.

add file content #577

Are you sure you want to change the base?

add file content #577

Conversation

BrentBlanckaert commented Jan 11, 2025

BrentBlanckaert commented Feb 18, 2025 • edited Loading

Documentation for usage of files

Files

Stdin, Stdout and Stderr

jorg-vr commented Feb 26, 2025

jorg-vr left a comment

Choose a reason for hiding this comment

jorg-vr Feb 26, 2025

Choose a reason for hiding this comment

niknetniko left a comment

Choose a reason for hiding this comment

niknetniko Mar 1, 2025

Choose a reason for hiding this comment

niknetniko Mar 1, 2025

Choose a reason for hiding this comment

BrentBlanckaert commented Feb 18, 2025 •

edited

Loading