Research: Capturing output from interactive tool e.g. `fzf` #5342

anki-code · 2024-04-17T12:22:39Z

Motivation

fzf is a common tool to fuzzy search and commonly used to choosing file name. People expect that it works in captured object mode !() but this mode was made to capture everything (stdout/stderr) and the people expectations are not satisfied because fzf use stderr to show UI and produces unexpected exceptions. We need to have solid approach of how to operate with this kind of tools and fix exceptions.

UPD: Instant answer

Use $() to operate this this kind of tools. The captured mode operator !() only for commands that have no any interaction with terminal.

Goals

Catch and fix logic errors and exceptions.
Fix what is possible to work as expected.
For the complex cases provide solid way to achieve expected result without errors.

Test cases

Pipe (#2404 #2159 #5003 #4444):

r1 = $(fzf)
r2 = $(ls / | fzf)  # I'm using `ls` for the test because it allows to catch issues around interactive usage because shows the colors.
r3 = $(ls / | fzf | head)
r1, r2, r3

r1 = !(fzf)
r2 = !(ls / | fzf)
r3 = !(ls / | fzf | head)
r1, r2, r3

r1 = ![fzf]
r2 = ![ls / | fzf]
r3 = ![ls / | fzf | head]
r1, r2, r3

r1 = $[fzf]
r2 = $[ls / | fzf]
r3 = $[ls / | fzf | head]
r1, r2, r3

# TODO: 
# * Alias at the beginning, middle, end.
# * Different type of alias (string, function, ExecAlias, threadable, unthreadable, capturable, uncapturable).

Alias:

@aliases.register('astd')
def _astd():
    r1 = $(fzf)
    r2 = $(echo 123 | fzf)
    r3 = $(echo 123 | fzf | head)
    r1, r2, r3

@aliases.register('aobj')
def _aobj():
    r1 = !(fzf)
    r2 = !(echo 123 | fzf)
    r3 = !(echo 123 | fzf | head)
    r1, r2, r3

@aliases.register('ahid')
def _ahid():
    r1 = ![fzf]
    r2 = ![echo 123 | fzf]
    r3 = ![echo 123 | fzf | head]
    r1, r2, r3

@aliases.register('aunc')
def _aunc():
    r1 = $[fzf]
    r2 = $[echo 123 | fzf]
    r3 = $[echo 123 | fzf | head]
    r1, r2, r3

# TODO: @unthreadable

# Alias
aliases['r1'] = "fzf"
aliases['r2'] = "echo 123 | fzf"
aliases['r3'] = "echo 123 | fzf | head"

# ExecAlias
aliases['er1'] = "@('fzf')"
aliases['er2'] = "echo 123 | @('fzf')"
aliases['er3'] = "echo 123 | @('fzf') | head"

PTK (#4700):

from prompt_toolkit.keys import Keys
@events.on_ptk_create
def custom_keybindings(bindings, **kw):
    @bindings.add(Keys.ControlR)
    def fzf_history(event):
        choice = $(@('fzf'))
# Press Ctrl+R

Script mode:

echo 'r = !(fzf)' > script.xsh
xonsh --no-rc script.xsh

TODO: XonshMode (main.py): single_command, script_from_file, script_from_file, interactive.

Exceptions

Test cases could work when we run it in fresh session and once BUT if we randomly run different kind of capturing in one session (i.e. $(),![], etc) we may have unexpected errors:

I/O operation on closed file.
RecursionError: maximum recursion depth exceeded (!(echo 123 | fzf))
stty: 'standard input': unable to perform all requested operations.
Interrupted system call
Bad file descriptor

This issue is to catch and fix them.

Docs

Basics:

Find the code examples:

Subprocess/popen/thread:

Signals in threads:

TTY/stdin/stdout/stderr:

Unthreading:

Tools

xunter - Profiling for the xonsh shell based on python-hunter.

Uncompleted efforts around threads

jnoortheen - rm_threads_subproc - try to remve threads.
scopatz - nothread - try to remove threads prediction.
astraluma - slug - try to recreate command pipelines.

For community

⬇️ Please click the 👍 reaction instead of leaving a +1 or 👍 comment

The text was updated successfully, but these errors were encountered:

anki-code · 2024-04-18T11:51:53Z

Example for playing with Popen and fzf (it works perfect on macOS and also when ran from xonsh):

# This is the same as bash `ls / | fzf 2> /tmp/fzf_stderr`
import subprocess as sp
ls = sp.Popen(('ls','/'), stdout=sp.PIPE)
fzf = sp.Popen(["fzf"], stdin=ls.stdout, stdout=sp.PIPE)  # stderr=sp.PIPE - hide the fzf UI
head = sp.Popen(["head"], stdin=fzf.stdout, stdout=sp.PIPE)
stdout, stderr = head.communicate()

print('stdout:', repr(stdout))
print('stderr:', repr(stderr))

💡 I found that if we set stderr=sp.PIPE in sp.Popen(["fzf"] we will have the case when there is no fzf interface on the screen but if you press Enter or use arrows you will get the right result.

🤔 Hypothesis: when we capture stderr from fzf we have the case with hidden fzf UI.

anki-code · 2024-04-18T14:00:54Z

I got answer from fzf owner:

fzf:
* Input list: Read from STDIN
* User input: Read from /dev/tty
* UI: Print to STDERR
* Selected item: Print to STDOUT

anki-code · 2024-04-19T15:10:09Z

Found repeatable case. ⚠️ Need to fix.

Minimal case:

xonsh --no-rc
!(fzf)
# Ctrl-C - RecursionError: maximum recursion depth exceeded
# Back to the terminal.
$(echo 1)
# Exception

How it works:

During run !(fzf) we create xonsh.procs.posix.PopenThread with fzf that waits user input from /dev/tty.
By pressing Ctrl+C we force thread to closing (see PopenThread._signal_int) but because fzf waiting for tty it stays alive without returning the exit code and waiting/polling processes are endless and after time we see maximum recursion depth exceeded.
After we back to terminal the thread still alive e.g. pstree -s fzf.
When we run $(fzf) we see i/o exception because previous process is on going.

The challenge: we need to carefully terminate Popen(fzf) after catching the Ctrl+C.

We can do proc.returncode=130 (to stop endless poll-ing loop) and proc.kill() in PopenThread._signal_int but let's find more better solution like sending Ctrl+C (sigint) to tty to say fzf stop working normally.

--

Two types of exceptions can be. First:

Exception in thread Thread-11:
object address  : 0x111c12bc0
object refcount : 2
object type     : 0x104aefd00
object type name: ValueError
object repr     : ValueError('I/O operation on closed file.')
lost sys.stderr
Exception in threading.excepthook:
Traceback (most recent call last):
  File "/Users/pc/.local/xonsh-env/lib/python3.12/threading.py", line 1379, in invoke_excepthook
    hook(args)
  File "/Users/pc/.local/xonsh-env/lib/python3.12/site-packages/xonsh/base_shell.py", line 198, in flush
    getattr(getattr(self, "mem", lambda: None), "flush", lambda: None)()
ValueError: I/O operation on closed file.

Second:

Exception in thread Thread-12:
Traceback (most recent call last):
  File "/usr/local/lib/python3.11/threading.py", line 1038, in _bootstrap_inner
    self.run()
  File "/usr/local/lib/python3.11/site-packages/xonsh/procs/posix.py", line 180, in run
    i = self._read_write(procout, stdout, sys.__stdout__)
        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/xonsh/procs/posix.py", line 223, in _read_write
    writer.flush()
ValueError: I/O operation on closed file.

--

Related code points:

xonsh/xonsh/procs/pipelines.py

Line 333 in 2419eaa

proc.wait()

xonsh/xonsh/procs/posix.py

Line 162 in 8ff5079

while proc.poll() is None:

anki-code · 2024-04-23T16:43:58Z

New knowledge (for me). Short list:

If we run !(sleep 10) it has STAT=S (sleep) in ps au and can be interrupted by SIGINT any time successfully.
If we run !(fzf) it will immediately have STAT=T (stopped) in ps au. If you try to send continue signal to the process many times (i.e. 5 times run kill -SIGCONT <pid of thread>) you will see output in stderr and have successful ending with stderr "Failed to read /dev/tty".
If we run !(python -c 'input("i")') the command will immediately have STAT=T (stopped) in ps au but to successfully interrupt it you need to send SIGINT signal and then send SIGCONT. It's enough to success.

Сonclusion:

We see the situation that interactive process was stopped after run in thread (background?). This looks like Terminal Access Control Mechanism in action. CommandPipeline and PopenThread don’t know how to handle this situation. I'm going to treat this as a feature to implement. UPD: see #5361

Any help are very welcome! @jnoortheen @gforsyth

jnoortheen · 2024-04-23T16:53:05Z

You are doing a great work. As I tried to remove the threaded subprocs, most of the code has used threads in some way and needed more time and haven't looked beyond the surface.

### Motivation There is annoying behavior when you run command in the loop and can't interrupt e.g. [this report](#5371) and a bit #5342. After diving into this I see the issue around return code. ### The essence Basically ``p = subprocess.Popen()`` populates ``p.returncode`` after ``p.wait()``, ``p.poll()`` or ``p.communicate()`` ([doc](https://docs.python.org/3/library/os.html#os.waitpid)). But if you're using `os.waitpid()` BEFORE these functions you're capturing return code from a signal subsystem and ``p.returncode`` will be ``0`` like success but it's not success. So after ``os.waitid`` call you need to set return code manually ``p.returncode = -os.WTERMSIG(status)`` like in Popen. Example: ```xsh python # python interactive import os, signal, subprocess as sp p = sp.Popen(['sleep', '100']) p.pid # 123 p.wait() # Ctrl+C or `kill -SIGINT 123` from another terminal p.returncode # -2 # BUT: p = sp.Popen(['sleep', '100']) p.pid # 123 pid, status = os.waitpid(p.pid, os.WUNTRACED) # status=2 # From another terminal: kill -SIGINT 123 p.wait() # 0 p.returncode # 0 ``` ```xsh from xonsh.tools import describe_waitpid_status describe_waitpid_status(2) # WIFEXITED - False - Return True if the process returning status exited via the exit() system call. # WEXITSTATUS - 0 - Return the process return code from status. # WIFSIGNALED - True - Return True if the process returning status was terminated by a signal. # WTERMSIG - 2 - Return the signal that terminated the process that provided the status value. # WIFSTOPPED - False - Return True if the process returning status was stopped. # WSTOPSIG - 0 - Return the signal that stopped the process that provided the status value. ``` See also: [Helpful things for knight](#5361 (comment)). ### Before ```xsh $RAISE_SUBPROC_ERROR = True sleep 100 # Ctrl+C _.rtn # 0 # It's wrong and RAISE_SUBPROC_ERROR ignored. for i in range(5): print(i) sleep 5 # 0 # Ctrl+C # Can't interrupt # 1 # 2 ``` ### After ```xsh sleep 100 # Ctrl+C _.rtn # -2 # Like in default Popen $RAISE_SUBPROC_ERROR = True for i in range(5): print(i) sleep 5 # 0 # Ctrl+C # subprocess.CalledProcessError ``` ### Notes * We need to refactor `xonsh.jobs`. It's pretty uncomfortable work with module. * The logic is blurry between Specs, Pipelines and Jobs. We need to bring more clear functions. * The `captured` variable looks like "just the way of capturing (stdout, object)" but in fact it affects all logic very much. We need to create table where we can see the logic difference for every type of capturing. E.g. in `captured=stdout` mode we use `xonsh.jobs` to monitor the command but in `captured=object` we avoid this and some logic from `xonsh.jobs` applied in `stdout` mode but missed in `object` mode. We need clear map. ## For community ⬇️ **Please click the 👍 reaction instead of leaving a `+1` or 👍 comment** --------- Co-authored-by: a <1@1.1> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

anki-code · 2024-05-06T11:12:46Z

Current state. I made a bunch of merged pull requests during the looking around (see above).
Today I'm seeing the essence of this issue as: what we need to do if in full captured mode the process is waiting for input?
And this question has no obvious answer.

UPD: OS suspends these processes and posix shells keep them tracked in jobs. I've implemented the same in 5361

junegunn · 2024-05-14T07:37:00Z

@anki-code This might be of your interest. junegunn/fzf@d274d09

…5422) ### Motivation There is no way to access to the CommandPipeline of executed command in uncaptured mode. This causes: * No chance to get return code in one universal way. * Barrier to trace and inspection. ### Before ```xsh $(ls nofile) # No way to access to the CommandPipeline of executed command. ``` ### After ```xsh $(ls nofile) __xonsh__.last.rtn # In interactive or not interactive. # 2 ``` ```xsh # Sugar sir: last = type('LastCP', (object,), {'__getattr__':lambda self, name: getattr(__xonsh__.last, name) })() ls nofile last.rtn # 2 ``` JFYI #5342 ## For community ⬇️ **Please click the 👍 reaction instead of leaving a `+1` or 👍 comment** --------- Co-authored-by: a <1@1.1> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

anki-code added threading integration-with-other-tools labels Apr 17, 2024

This was referenced Apr 17, 2024

!(fzf) doesn't capture output reliably #5190

Closed

!(fzf | cat) fails to capture output #5189

Closed

$(fzf) unexpected result #4700

Closed

anki-code changed the title ~~fzf: solve cases for tools like fzf~~ fzf: solve cases for tools like fzf (interactive tool with output) Apr 17, 2024

anki-code added the priority-high label Apr 17, 2024

anki-code changed the title ~~fzf: solve cases for tools like fzf (interactive tool with output)~~ fzf: solve cases for tools like fzf (capturing output from interactive tool) Apr 17, 2024

anki-code changed the title ~~fzf: solve cases for tools like fzf (capturing output from interactive tool)~~ Capturing output from interactive tool e.g. fzf Apr 17, 2024

anki-code mentioned this issue Apr 17, 2024

How to capture output of programs that use alternate mode #2404

Closed

anki-code added the help wanted label Apr 17, 2024

anki-code mentioned this issue Apr 18, 2024

Re-rewrite Executor #2159

Open

2 tasks

anki-code mentioned this issue Apr 18, 2024

How fzf operate with stdin/stdout/stderr/tty/pipe? junegunn/fzf#3741

Closed

anki-code removed the help wanted label Apr 19, 2024

anki-code mentioned this issue Apr 19, 2024

Re-thinking the command substitution operators #3924

Closed

This was referenced Apr 24, 2024

Added detailed info about thread to thread exceptions #5357

Merged

Read stop signals from the process and update the process state. #5361

Merged

jobs: catching ChildProcessError #5365

Merged

XONSH_TRACE_SUBPROC returns more useful details #5368

Merged

anki-code mentioned this issue May 1, 2024

Fixed populating the return code for interrupted process. #5380

Merged

anki-code changed the title ~~Capturing output from interactive tool e.g. fzf~~ Research: Capturing output from interactive tool e.g. fzf May 15, 2024

anki-code mentioned this issue May 18, 2024

Now last executed CommandPipeline is available in __xonsh__.last #5422

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research: Capturing output from interactive tool e.g. `fzf` #5342

Research: Capturing output from interactive tool e.g. `fzf` #5342

anki-code commented Apr 17, 2024 •

edited

anki-code commented Apr 18, 2024 •

edited

anki-code commented Apr 18, 2024

anki-code commented Apr 19, 2024 •

edited

anki-code commented Apr 23, 2024 •

edited

jnoortheen commented Apr 23, 2024

anki-code commented May 6, 2024 •

edited

junegunn commented May 14, 2024

Research: Capturing output from interactive tool e.g. fzf #5342

Research: Capturing output from interactive tool e.g. fzf #5342

Comments

anki-code commented Apr 17, 2024 • edited

Motivation

UPD: Instant answer

Goals

Test cases

Exceptions

Docs

Tools

Uncompleted efforts around threads

For community

anki-code commented Apr 18, 2024 • edited

anki-code commented Apr 18, 2024

anki-code commented Apr 19, 2024 • edited

anki-code commented Apr 23, 2024 • edited

jnoortheen commented Apr 23, 2024

anki-code commented May 6, 2024 • edited

junegunn commented May 14, 2024

Research: Capturing output from interactive tool e.g. `fzf` #5342

Research: Capturing output from interactive tool e.g. `fzf` #5342

anki-code commented Apr 17, 2024 •

edited

anki-code commented Apr 18, 2024 •

edited

anki-code commented Apr 19, 2024 •

edited

anki-code commented Apr 23, 2024 •

edited

anki-code commented May 6, 2024 •

edited