Commit Graph

425 Commits

Author SHA1 Message Date
Nicolas Kruse 61dc29e68b added auto grad feature "grad(..)" 2025-12-03 17:30:38 +01:00
Nicolas Kruse c5048980c2 added optimization for + 0, - 0 and * 1 operations 2025-12-03 17:28:49 +01:00
Nicolas Kruse a30ee12d0f fixed get_all_dag_edges prevent it emitting edges multiple time 2025-12-03 17:27:29 +01:00
Nicolas Kruse cc5582ae73 replaced list type by Sequence to improve type hinting 2025-12-02 16:59:14 +01:00
Nicolas Kruse d2df1dd3fb cp.sign and cp.relu added to _math.py 2025-12-02 16:57:22 +01:00
Nicolas Kruse 5bdd77db91 readme updated 2025-12-02 16:51:20 +01:00
Nicolas Kruse 9f77ef5642 optimization for float/int operations added 2025-12-01 16:44:49 +01:00
Nicolas Kruse 8fe51a2e45 benchmark script added 2025-12-01 08:29:39 +01:00
Nicolas Kruse a56320f6e0 test for matrix class added 2025-12-01 00:13:51 +01:00
Nicolas Kruse e14092a2c3 matrix class added 2025-12-01 00:13:38 +01:00
Nicolas Kruse 29f5a26278 helping functions separated for vectors and matrices 2025-12-01 00:13:08 +01:00
Nicolas Kruse bb4472eccb variable type hints updated 2025-12-01 00:12:22 +01:00
Nicolas Kruse a8eeea874b median, mean and argsort added for vector class 2025-11-27 17:21:33 +01:00
Nicolas Kruse 054ce6d507 __iter__ type annotation fixed for vector class 2025-11-27 17:20:46 +01:00
Nicolas Kruse 89e8efb864 read_value function extended to work with vectors 2025-11-27 17:20:07 +01:00
Nicolas Kruse ad78c4089c type annotations for iif function fixed 2025-11-27 17:19:25 +01:00
Nicolas Kruse 99a880861a typing fixed, variable[bool] replaced by variable[int] 2025-11-27 12:50:53 +01:00
Nicolas Kruse 44b215f728 check_for_qemu function in tests updated to prevent error in error if qemu is not available 2025-11-27 10:10:13 +01:00
Nicolas Kruse d58a723448
Merge branch 'main' into dev 2025-11-24 19:36:12 +01:00
Nicolas afa8985a7f cd: wheel build script updated for armv7 2025-11-24 16:40:37 +01:00
Nicolas 4a8f758a45 tests updated and extended for armv7 2025-11-24 16:40:37 +01:00
Nicolas 77924a5655 Aux functions are now copied with there whole section to alow for branches without relocation inside of sections. 2025-11-24 16:40:37 +01:00
Nicolas 78a159b4ea linking for required libgcc function on 32 bit arm added 2025-11-24 16:40:37 +01:00
Nicolas Kruse 1a062760f7 detect_process_arch() updated to work for armv7 2025-11-24 16:40:37 +01:00
Nicolas Kruse 4504abf5cb Update musl repository URL in build scripts with a mirror 2025-11-24 16:40:37 +01:00
Nicolas 0d119f5a0c compiler version removed for arm 2025-11-24 16:40:37 +01:00
Nicolas 2191889082 test for armv7 added 2025-11-24 16:40:37 +01:00
Nicolas 8db2c41e1b ci and build scripts updated for armv7 2025-11-24 16:40:37 +01:00
Nicolas 4c4e83a18e R_ARM_JUMP24 & R_ARM_CALL patching added 2025-11-24 16:40:37 +01:00
Nicolas 45808d8373 Switched to pelfy 1.0.7 to support armv7 relocations 2025-11-24 16:40:37 +01:00
Nicolas d5786d2957 addend retrieval removed from patch_arm32_abs (moved to pelfy) 2025-11-24 16:40:37 +01:00
Nicolas fcd34e6937 stencil alignment for ARM set to 4 - gcc complaints otherwise 2025-11-24 16:40:37 +01:00
Nicolas 1e654c65f8 test added for 32 bit arm 2025-11-24 16:40:37 +01:00
Nicolas a93cdd4b81 ci and build scripts updated to build arm v6 and v7 2025-11-24 16:40:37 +01:00
Nicolas 3b1c8f0e72 detection for different arm architectures added 2025-11-24 16:40:37 +01:00
Nicolas 96a1a153d4 PATCH_OBJECT_ARM32_ABS patching added to support 32 bit ARM 2025-11-24 16:40:37 +01:00
Nicolas Kruse dbc6e6194c
Merge pull request #10 from Nonannet/dev
New vector function added
2025-11-22 12:26:57 +01:00
Nicolas Kruse bf089d7f7e test for vector functions extended 2025-11-19 10:01:51 +01:00
Nicolas Kruse 4049928139 vector functions extended 2025-11-19 10:01:30 +01:00
Nicolas Kruse b279da800a
Merge pull request #9 from Nonannet/dev
Element wise math operations for vectors
2025-11-17 21:21:21 +01:00
Nicolas Kruse beddf2e7e9 test math cleaned up 2025-11-17 08:33:49 +01:00
Nicolas Kruse ebb3b89e69 math functions extended for vector. switched acos function. 2025-11-17 08:33:29 +01:00
Nicolas Kruse 2e3ececed2
Merge pull request #8 from Nonannet/dev
Compilation speed drastically improved
2025-11-14 23:23:21 +01:00
Nicolas Kruse ff5987f5d1 test for timing compiler steps added 2025-11-14 22:50:23 +01:00
Nicolas Kruse 700d46e404 Compiler performance improved, caching added for get_stencil_code function 2025-11-14 22:47:59 +01:00
Nicolas Kruse 68186fbf69 ci: x86 removed from wheel build script 2025-11-14 19:41:11 +01:00
Nicolas 20a8dcd1a2 Single argument functions do not need a dummy argument anymore 2025-11-14 17:28:05 +01:00
Nicolas 4b752a6094 Download script for precompiled binaries updated and version changed to 0.0.1 2025-11-14 16:35:01 +01:00
Nicolas 0c2f2812db caching for stencil_database.get_relocations added 2025-11-14 16:23:51 +01:00
Nicolas c25ecac86e ci musl: use -fno-pic flag only for x86 2025-11-14 11:12:36 +01:00