Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aborted (core dumped) in flow.nn.MaxUnpool1d/MaxUnpool2d/MaxUnpool3d #10521

Open
x0w3n opened this issue May 12, 2024 · 0 comments
Open

Aborted (core dumped) in flow.nn.MaxUnpool1d/MaxUnpool2d/MaxUnpool3d #10521

x0w3n opened this issue May 12, 2024 · 0 comments
Labels
bug community events from community

Comments

@x0w3n
Copy link

x0w3n commented May 12, 2024

Summary

When input some boundary values to the parameters of oneflow.nn.MaxUnpool1d/MaxUnpool2d/MaxUnpool3d, it triggers the crash.

Code to reproduce bug

MaxUnpool1d:

import oneflow as flow
unpool = flow.nn.MaxUnpool1d(2, stride=2,padding=-8353862602220610428)
input = flow.tensor([[[1., 2, 3, 4, 5, 6, 7, 8, 9]]])
unpool(flow.ones(1),flow.ones(1), output_size=input.size())

ouput:

F20240511 21:50:08.571604 221637 user_op_tensor.h:66] tensor data_type mismatched. value: kFloat, template T:kInt64
*** Check failure stack trace: ***
    @     0x7fffeb829a63  google::LogMessage::Fail()
    @     0x7fffeb82c184  google::LogMessage::SendToLog()
    @     0x7fffeb82959e  google::LogMessage::Flush()
    @     0x7fffeb82c76f  google::LogMessageFatal::~LogMessageFatal()
    @     0x7ffff161a657  oneflow::user_op::Tensor::CheckDataType<>()
    @     0x7ffff1d5323a  oneflow::MaxUnpoolNdKernel<>::Compute()
    @     0x7ffff22cf58a  oneflow::one::StatefulOpKernel::Compute()
    @     0x7ffff1515d91  oneflow::vm::OpCallInstructionUtil::Compute()
    @     0x7ffff15104b0  _ZZN7oneflow2vm23OpCallInstructionPolicy7ComputeEPNS0_11InstructionEENKUlPKcE_clES5_.constprop.0
    @     0x7ffff1510d2b  oneflow::vm::OpCallInstructionPolicy::Compute()
    @     0x7ffff150abe7  oneflow::vm::Instruction::Compute()
    @     0x7ffff1506819  oneflow::vm::EpStreamPolicyBase::Run()
    @     0x7ffff156f369  oneflow::vm::ThreadCtx::TryReceiveAndRun()
    @     0x7ffff1573848  oneflow::(anonymous namespace)::WorkerLoop()
    @     0x7ffff157424a  _ZNSt6thread11_State_implINS_8_InvokerISt5tupleIJPFvPN7oneflow2vm9ThreadCtxERKSt8functionIFvS6_EEES6_ZNS3_14VirtualMachine15CreateThreadCtxENS3_6SymbolINS3_6DeviceEEENS3_10StreamTypeEmEUlS6_E3_EEEEE6_M_runEv
    @     0x7fffeb6c2253  (unknown)
    @     0x7ffff7d27ac3  (unknown)
    @     0x7ffff7db9850  (unknown)
Stack trace (most recent call last) in thread 221637:
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1574249, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1573847, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff156f368, in vm::ThreadCtx::TryReceiveAndRun()
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1506818, in vm::EpStreamPolicyBase::Run(vm::Instruction*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff150abe6, in vm::Instruction::Compute()
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1510d2a, in vm::OpCallInstructionPolicy::Compute(vm::Instruction*)
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff15104af, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1515d90, in vm::OpCallInstructionUtil::Compute(vm::OpCallInstructionPolicy*, vm::Stream*, bool, bool)
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff22cf589, in StatefulOpKernel::Compute(eager::CallContext*, ep::Stream*, user_op::OpKernel const*, user_op::OpKernelState*, user_op::OpKernelCache const*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1d53239, in MaxUnpoolNdKernel<(DeviceType)1, float>::Compute(user_op::KernelComputeContext*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff161a656, in void user_op::Tensor::CheckDataType<long>() const

Aborted (Signal sent by tkill() 221396 0)
Aborted (core dumped)

MaxUnpool2d:

import oneflow as flow
unpool = flow.nn.MaxUnpool2d(2, stride=2,padding=-8353862602220610428)
input = flow.tensor([[[1., 2, 3, 4, 5, 6, 7, 8, 9]]])
unpool(flow.ones(1),flow.ones(1), output_size=input.size())

ouput:

F20240511 21:51:04.809286 222046 user_op_tensor.h:66] tensor data_type mismatched. value: kFloat, template T:kInt64
*** Check failure stack trace: ***
    @     0x7fffeb829a63  google::LogMessage::Fail()
    @     0x7fffeb82c184  google::LogMessage::SendToLog()
    @     0x7fffeb82959e  google::LogMessage::Flush()
    @     0x7fffeb82c76f  google::LogMessageFatal::~LogMessageFatal()
    @     0x7ffff161a657  oneflow::user_op::Tensor::CheckDataType<>()
    @     0x7ffff1d5323a  oneflow::MaxUnpoolNdKernel<>::Compute()
    @     0x7ffff22cf58a  oneflow::one::StatefulOpKernel::Compute()
    @     0x7ffff1515d91  oneflow::vm::OpCallInstructionUtil::Compute()
    @     0x7ffff15104b0  _ZZN7oneflow2vm23OpCallInstructionPolicy7ComputeEPNS0_11InstructionEENKUlPKcE_clES5_.constprop.0
    @     0x7ffff1510d2b  oneflow::vm::OpCallInstructionPolicy::Compute()
    @     0x7ffff150abe7  oneflow::vm::Instruction::Compute()
    @     0x7ffff1506819  oneflow::vm::EpStreamPolicyBase::Run()
    @     0x7ffff156f369  oneflow::vm::ThreadCtx::TryReceiveAndRun()
    @     0x7ffff1573848  oneflow::(anonymous namespace)::WorkerLoop()
    @     0x7ffff157424a  _ZNSt6thread11_State_implINS_8_InvokerISt5tupleIJPFvPN7oneflow2vm9ThreadCtxERKSt8functionIFvS6_EEES6_ZNS3_14VirtualMachine15CreateThreadCtxENS3_6SymbolINS3_6DeviceEEENS3_10StreamTypeEmEUlS6_E3_EEEEE6_M_runEv
    @     0x7fffeb6c2253  (unknown)
    @     0x7ffff7d27ac3  (unknown)
    @     0x7ffff7db9850  (unknown)
Stack trace (most recent call last) in thread 222046:
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1574249, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1573847, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff156f368, in vm::ThreadCtx::TryReceiveAndRun()
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1506818, in vm::EpStreamPolicyBase::Run(vm::Instruction*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff150abe6, in vm::Instruction::Compute()
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1510d2a, in vm::OpCallInstructionPolicy::Compute(vm::Instruction*)
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff15104af, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1515d90, in vm::OpCallInstructionUtil::Compute(vm::OpCallInstructionPolicy*, vm::Stream*, bool, bool)
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff22cf589, in StatefulOpKernel::Compute(eager::CallContext*, ep::Stream*, user_op::OpKernel const*, user_op::OpKernelState*, user_op::OpKernelCache const*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1d53239, in MaxUnpoolNdKernel<(DeviceType)1, float>::Compute(user_op::KernelComputeContext*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff161a656, in void user_op::Tensor::CheckDataType<long>() const

Aborted (Signal sent by tkill() 221808 0)
Aborted (core dumped)

MaxUnpool3d:

import oneflow as flow
unpool = flow.nn.MaxUnpool3d(2, stride=2,padding=-8353862602220610428)
input = flow.tensor([[[1., 2, 3, 4, 5, 6, 7, 8, 9]]])
unpool(flow.ones(1),flow.ones(1), output_size=input.size())

ouput:

F20240511 21:52:05.484270 222469 user_op_tensor.h:66] tensor data_type mismatched. value: kFloat, template T:kInt64
*** Check failure stack trace: ***
    @     0x7fffeb829a63  google::LogMessage::Fail()
    @     0x7fffeb82c184  google::LogMessage::SendToLog()
    @     0x7fffeb82959e  google::LogMessage::Flush()
    @     0x7fffeb82c76f  google::LogMessageFatal::~LogMessageFatal()
    @     0x7ffff161a657  oneflow::user_op::Tensor::CheckDataType<>()
    @     0x7ffff1d5323a  oneflow::MaxUnpoolNdKernel<>::Compute()
    @     0x7ffff22cf58a  oneflow::one::StatefulOpKernel::Compute()
    @     0x7ffff1515d91  oneflow::vm::OpCallInstructionUtil::Compute()
    @     0x7ffff15104b0  _ZZN7oneflow2vm23OpCallInstructionPolicy7ComputeEPNS0_11InstructionEENKUlPKcE_clES5_.constprop.0
    @     0x7ffff1510d2b  oneflow::vm::OpCallInstructionPolicy::Compute()
    @     0x7ffff150abe7  oneflow::vm::Instruction::Compute()
    @     0x7ffff1506819  oneflow::vm::EpStreamPolicyBase::Run()
    @     0x7ffff156f369  oneflow::vm::ThreadCtx::TryReceiveAndRun()
    @     0x7ffff1573848  oneflow::(anonymous namespace)::WorkerLoop()
    @     0x7ffff157424a  _ZNSt6thread11_State_implINS_8_InvokerISt5tupleIJPFvPN7oneflow2vm9ThreadCtxERKSt8functionIFvS6_EEES6_ZNS3_14VirtualMachine15CreateThreadCtxENS3_6SymbolINS3_6DeviceEEENS3_10StreamTypeEmEUlS6_E3_EEEEE6_M_runEv
    @     0x7fffeb6c2253  (unknown)
    @     0x7ffff7d27ac3  (unknown)
    @     0x7ffff7db9850  (unknown)
Stack trace (most recent call last) in thread 222469:
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1574249, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1573847, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff156f368, in vm::ThreadCtx::TryReceiveAndRun()
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1506818, in vm::EpStreamPolicyBase::Run(vm::Instruction*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff150abe6, in vm::Instruction::Compute()
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1510d2a, in vm::OpCallInstructionPolicy::Compute(vm::Instruction*)
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff15104af, in 
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1515d90, in vm::OpCallInstructionUtil::Compute(vm::OpCallInstructionPolicy*, vm::Stream*, bool, bool)
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff22cf589, in StatefulOpKernel::Compute(eager::CallContext*, ep::Stream*, user_op::OpKernel const*, user_op::OpKernelState*, user_op::OpKernelCache const*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff1d53239, in MaxUnpoolNdKernel<(DeviceType)1, float>::Compute(user_op::KernelComputeContext*) const
   Object "/home/temp/oneflow-1.0.0/build/liboneflow.so", at 0x7ffff161a656, in void user_op::Tensor::CheckDataType<long>() const

Aborted (Signal sent by tkill() 222228 0)
Aborted (core dumped)

System Information

  • What is your OneFlow installation (pip, source, dockerhub): pip
  • OS: Ubuntu 22.04.3 LTS
  • OneFlow version (run python3 -m oneflow --doctor):
path: ['*/temp/oneflow-1.0.0/python/oneflow']
version: 0.9.1+cpu.git.ec05076
git_commit: ec05076
cmake_build_type: Release
rdma: False
mlir: False
  • Python version: 3.10.0
  • CUDA driver version: 12.2
  • GPU models: NVIDIA GeForce RTX 3090
  • Other info: None
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug community events from community
Projects
None yet
Development

No branches or pull requests

1 participant