-
Notifications
You must be signed in to change notification settings - Fork 74.7k
Insights: tensorflow/tensorflow
Overview
Could not load contribution data
Please try again later
257 Pull requests merged by 6 people
-
Fix tfrt gpu client shutdown process
#93058 merged
May 11, 2025 -
[async-pjrt] Add semaphore to limit max inflight computations
#93105 merged
May 11, 2025 -
Fix a bug where
ToLiteral
fails with a token buffer#93102 merged
May 10, 2025 -
Remove the semaphore in ExecuteHelper
#93100 merged
May 10, 2025 -
[async-pjrt] Make H2D callback nonblocking
#93095 merged
May 10, 2025 -
Emit a metric for unified model id of loaded models
#93099 merged
May 10, 2025 -
Automated Code Change
#93071 merged
May 10, 2025 -
Automated Code Change
#93068 merged
May 10, 2025 -
Fall back to the CPU tensor's shape when the device buffer has a dynamic shape
#93089 merged
May 10, 2025 -
Reverts changelist 714988843
#93038 merged
May 10, 2025 -
Update CustomCallApiVersion for NopReturnToken in hlo_control_flow_flattening.
#93019 merged
May 10, 2025 -
XLA:CPU: If configured, use more memory types and change the
#92982 merged
May 10, 2025 -
Automated Code Change
#93079 merged
May 10, 2025 -
Automated Code Change
#93001 merged
May 10, 2025 -
Automated Code Change
#93072 merged
May 10, 2025 -
Automated Code Change
#93003 merged
May 10, 2025 -
[PJRT:GPU] Remove unnecessary synchronization in donating a buffer.
#93065 merged
May 10, 2025 -
Fix a crash when printing an invalid
Layout
.#93051 merged
May 10, 2025 -
Enable building
upb
with Clang.#92976 merged
May 9, 2025 -
[XLA:benchmarks] Add a workflow file to generate GHA input matrices
#93063 merged
May 9, 2025 -
Add CommonPjRtClient::CreateUninitializedBuffer and remove
#92974 merged
May 9, 2025 -
[XLA] Simplify ArrayTypeSwitch to infer the return type from the function given.
#92439 merged
May 9, 2025 -
Reverts 745718a9b6d5c65f8cf387283e4d853dbe095a87
#93054 merged
May 9, 2025 -
Fix race condition in tensorflow/python/data/flat_map_utils.cc.
#92992 merged
May 9, 2025 -
Add an API to compilation environments to initialize all known compilation environments
#92825 merged
May 9, 2025 -
Use low latency thread pool for async PjRT
#92991 merged
May 9, 2025 -
Manually migrate deprecated references to
strings::StrCat
andstrings::StrAppend
.#93035 merged
May 9, 2025 -
Update some tests.
#92968 merged
May 9, 2025 -
Use the stable libtpu release for TF wheel runtime, instead of a nightly one.
#92960 merged
May 9, 2025 -
Cleaned up coordination service key-value store.
#91424 merged
May 9, 2025 -
Remove unused "using" statement.
#93033 merged
May 9, 2025 -
Simplify
AbslStringify
fortstring
andAlphaNumBuffer
.#93044 merged
May 9, 2025 -
[XLA:GPU] Deprecate PIPELINE_PARALLELISM_OPT_LEVEL_ENABLE_CYCLE_DECOMPOSER
#90010 merged
May 9, 2025 -
Use the device buffer's shape to construct a literal during D2H
#93053 merged
May 9, 2025 -
Fix a type in hlo_computation.h.
#92989 merged
May 9, 2025 -
Reverts 69a3d55402d26a4e03b7b8cb6b45b0a8fe553707
#93050 merged
May 9, 2025 -
[XLA:GPU] (NFC) Refactor CollectivePermute processing in HloVerifier.
#93046 merged
May 9, 2025 -
[XLA:GPU] Add cuBLASLt autotuner backend. The backend returns configs for cuBLASLt custom calls.
#92966 merged
May 9, 2025 -
Deprecate
*_size()
methods to reduce theLayout
API surface.#92915 merged
May 9, 2025 -
Add support for literals with custom layouts in
xla::PjRtStreamExecutorBuffer::ToLiteral
#93029 merged
May 9, 2025 -
Updates ConvertToSolverRequest() to deduplicate node pairs, merging their costs into a single edge.
#92975 merged
May 9, 2025 -
[XLA] Fail the verifier for Call instructions that have output_to_operand_aliasing set.
#92980 merged
May 9, 2025 -
Remove unused
xla::CompiledMemoryStats::serialized_hlo_proto
#92918 merged
May 9, 2025 -
Add boolean attribute use_shardy_partitioner to XlaCallModule op.
#92910 merged
May 9, 2025 -
[GPU] Replace ConditionalThunkConfig branch_count field with method.
#93024 merged
May 9, 2025 -
Add unit tests for ragged_dot to reshards pass to show it supports.
#92963 merged
May 9, 2025 -
[XLA:GPU] Add RaggedAllToAll to the list of ops with supported low precision inputs.
#93037 merged
May 9, 2025 -
Add basic thunk deserialization logic
#92793 merged
May 9, 2025 -
Fix an error in the doc of
tf.strided_slice
#92956 merged
May 9, 2025 -
[XLA:GPU] Use cutlass 3.8.0
#92884 merged
May 9, 2025 -
Automated Code Change
#92953 merged
May 9, 2025 -
[mlir][tosa] Update variable read/write names to align with v1.0 spec
#92911 merged
May 9, 2025 -
Define a type alias for
tsl::RCReference<Value>
#92946 merged
May 9, 2025 -
[GPU] Refactor Thunk::ToProto() to return proto instead of using output param
#92961 merged
May 9, 2025 -
Automated Code Change
#92682 merged
May 9, 2025 -
Initialize distributed devices
#92942 merged
May 9, 2025 -
[XLA:SPMD] Fix Gather/Scatter preference initialization.
#92977 merged
May 9, 2025 -
Support Pinned Memory in H2D transfer.
#92979 merged
May 9, 2025 -
const-correct newly-added RegisterPluggableDevicePlugin API.
#92990 merged
May 9, 2025 -
Rename tf_quant ops similar to original quant ops
#92899 merged
May 9, 2025 -
Integrate LLVM at llvm/llvm-project@2d287f51eff2
#92967 merged
May 8, 2025 -
AllReduce,Case,If,Map,Reduce,Return,While : Direct StableHLO to HLO Translation.
#92590 merged
May 8, 2025 -
Allow multi-thread scheduling for async PJRT
#92983 merged
May 8, 2025 -
Don't print the shared object correspondence map.
#92959 merged
May 8, 2025 -
[TF2/C++] Add
TF_SKIP_LOADING_OPTIMIZED_FUNCTION_DEPENDENCIES
environment variable.#92965 merged
May 8, 2025 -
Added new
GroupExecute
API to XLA GPU collectives.#92964 merged
May 8, 2025 -
Branch quantization/satblehlo:test_passes to use TFQuantDialect
#92138 merged
May 8, 2025 -
Load python rules instead of using native rules.
#92746 merged
May 8, 2025 -
Add benchmark for Shape assignment.
#92824 merged
May 8, 2025 -
Remove extraneous onednn patch
#92970 merged
May 8, 2025 -
[XLA:CPU] Allow users to pass device assignment to NanoRt
#92948 merged
May 8, 2025 -
Fix TFLite Compiler warnings in Chromium build
#92908 merged
May 8, 2025 -
[tosa] Fix member access post-MLIR upstream 0c61b24
#92969 merged
May 8, 2025 -
[XLA][runtime] Add strongly typed xla specific GlobalDeviceId and LocalDeviceId.
#92951 merged
May 8, 2025 -
Fix logging format string not referring to all passed variables.
#92919 merged
May 8, 2025 -
Branch quantization/stablehlo/cc:pre_calibration to use TFQuantDialect
#92909 merged
May 8, 2025 -
Use
LoadedExecutableRef
instead ofstd::shared_ptr<LoadedExecutable>
#92944 merged
May 8, 2025 -
Use
ArrayRef
instead oftsl::RCReference<Array>
#92943 merged
May 8, 2025 -
Reverts 4e2dfdbb4e82695f868a90e823f9a8027ba48a4d
#92629 merged
May 8, 2025 -
[Cleanup] Cleanup whitespace
#92741 merged
May 8, 2025 -
[XLA:GPU] Remove CollectivePermuteValidIterationAnnotator pass
#89923 merged
May 8, 2025 -
Flatten the result tuples while converting HLO to StableHLO
#92312 merged
May 8, 2025 -
[XLA:GPU][Emitters] Use concatenate emitter for 4-bits datatypes
#91916 merged
May 8, 2025 -
Automated Code Change
#92950 merged
May 8, 2025 -
[XLA:CPU] Set fp-contract to on rather than fast for DUCC
#92887 merged
May 8, 2025 -
#sdy export reduce_scatter collective op in stablehlo export.
#92551 merged
May 8, 2025 -
Use transpose emitter for transposes with 4-bits datatypes
#92777 merged
May 8, 2025 -
PR #25635: Add parsing debug options support
#92940 merged
May 8, 2025 -
PR #26183: [XLA:GPU] Add support for SM103a and SM121a architectures (Blackwell)
#92881 merged
May 8, 2025 -
Add proto serialization for GPU CopyThunk
#92808 merged
May 8, 2025 -
Add To/FromProto() to BufferAllocation::Slice
#92897 merged
May 8, 2025 -
[XLA:GPU] Add triton support test for stochastic convert op
#92939 merged
May 8, 2025 -
Added min/max reductions on XNNPACK delegate level.
#92782 merged
May 8, 2025 -
Automated Code Change
#92931 merged
May 8, 2025 -
Re-enable test that is now passing.
#92888 merged
May 8, 2025 -
Automated Code Change
#92938 merged
May 8, 2025 -
Automated Code Change
#92671 merged
May 8, 2025 -
Fixed ppc64le build issues
#92769 merged
May 8, 2025 -
Remove support for sparse dimension's ordered attribute.
#92914 merged
May 8, 2025 -
Integrate StableHLO at openxla/stablehlo@630c315b
#92907 merged
May 8, 2025 -
Internal dir restructure
#90497 merged
May 8, 2025 -
Check whether createStream was successful
#92729 merged
May 8, 2025 -
Get rid of
HloModuleProto
fromCompiledMemoryStats
#92748 merged
May 7, 2025 -
Add traceme while borrowing streams
#92916 merged
May 7, 2025 -
Adds checkpoint test for tpu_embedding_v3
#92831 merged
May 7, 2025 -
[IFRT] Add simple string array tests to IFRT common array_impl_test
#92823 merged
May 7, 2025 -
[ReplicaGroupV2] Generate all-to-all collectives using IotaReplicaGroupList format.
#92755 merged
May 7, 2025 -
Use
ArrayRef
instead oftsl::RCReference<Array>
#92898 merged
May 7, 2025 -
Clarify the difference between internal and external version.
#92830 merged
May 7, 2025 -
A simple constant folder for case ops
#92812 merged
May 7, 2025 -
Add
strings::LegacyPrecision
to TSL.#92890 merged
May 7, 2025 -
Force disable build time Python pyc generation.
#92905 merged
May 7, 2025 -
This CL has no functional changes, but improved code structure:
#92091 merged
May 7, 2025 -
Adding an option to initialize Pluggable Device Plugin using function pointers instead of a DSO
#92816 merged
May 7, 2025 -
Fix missing cuda graph launch call back event in new version of CUDA
#92904 merged
May 7, 2025 -
Refactor BFS function to accept distance parameter
#92444 merged
May 7, 2025 -
Branch quantization/stablehlo/cc:post_calibration to use TFQuantDialect
#92889 merged
May 7, 2025 -
Introduce new
CompilePhaseHloRunnerPjRt
andExecutePhaseHloRunnerPjRt
.#92440 merged
May 7, 2025 -
Branch quantization/tensorflow:quantize_preprocess to use TFQuantDialect
#92886 merged
May 7, 2025 -
LogScheduleStatistics is very slow. Lower it's level from 1 to 3.
#89198 merged
May 7, 2025 -
[XLA:benchmarks] Add
--workflow_type
to the binary to generate GHA input matrices by workflow types#92903 merged
May 7, 2025 -
Fix HloRunnerPjRt DeserializeExecutable.
#92902 merged
May 7, 2025 -
Migration fuse_convolution_pass to tf_attrs_and_constraints
#92558 merged
May 7, 2025 -
Refactor the DiffSummary log print.
#92628 merged
May 7, 2025 -
Branch quantization/stablehlo/cc:tf_pass_pipeline to use TFQuantDialect
#92432 merged
May 7, 2025 -
Add an API to the URL generator for generating URLs without selecting nodes.
#92725 merged
May 7, 2025 -
[IFRT] Treat Maximal
HloSharding
on a single device as fully replicated#92810 merged
May 7, 2025 -
Clean up
LoadedExecutable::Delete
andLoadedExecutable::IsDeleted
#92791 merged
May 7, 2025 -
Remove support for sparse dimension's unique attribute.
#92753 merged
May 7, 2025 -
Support memory space in underlying buffer allocation.
#92821 merged
May 7, 2025 -
Support types that have
AbslStringify
, but notoperator<<
in the additional output ofTF_RET_CHECK
.#92720 merged
May 7, 2025 -
[XLA:benchmarks] Add a binary to generate GHA input matrices to trigger benchmark runs
#92807 merged
May 7, 2025 -
Remove
Shape::ShortDebugString()
to reduce theShape
API surface.#92565 merged
May 7, 2025 -
Only delegate 4 bit weights which are per channel quantized
#92891 merged
May 7, 2025 -
[StableHLO] Fix typo.
#92789 merged
May 7, 2025 -
Reverts eb4629ef35970666a34683d13d7fc8c5b421118a
#92883 merged
May 7, 2025 -
#sdy Apply
xla.sdy.FuncResultSharding
if it has no uses or it's used by an op that isn'tfunc.return
.#92882 merged
May 7, 2025 -
[XLA:GPU] Simplify
gpu_hlo_schedule_test
usingabsl::ScopedMockLog
.#92787 merged
May 7, 2025 -
Remove obsolete TODO
#92799 merged
May 7, 2025 -
[XLA:GPU] Favor pipelined over synchronous collectives when combining.
#92795 merged
May 7, 2025 -
[XLA:GPU] (NFC) Refactor ShapeToMlirTypes to use
ForEachLeafShape
.#92784 merged
May 7, 2025 -
[xla:cpu] Remove Tfrt from PJRT CPU buffer type names
#92758 merged
May 7, 2025 -
[XLA:GPU] (NFC) Simplify logic in CreateMLIRModule.
#92783 merged
May 7, 2025 -
PR #25919: feat: add runtime_version field to target_config
#92711 merged
May 7, 2025 -
Record manual axes instead of generating mhlo.sharding in stablehlo/shard_map_export.
#92533 merged
May 7, 2025 -
Automated Code Change
#92863 merged
May 7, 2025 -
Automated Code Change
#92857 merged
May 7, 2025 -
[XLA:GPU][Emitters] Correctly handle allocate_shared for s4 type.
#92879 merged
May 7, 2025 -
Automated Code Change
#92860 merged
May 7, 2025 -
[XLA:CPU] Migrate xla_jit_compiled_cpu_function to thunk execution
#90379 merged
May 7, 2025 -
PR #25441: [XLA:GPU] add cuDNN Paged attention support
#92703 merged
May 7, 2025 -
#sdy no need to add (and remove) a fake result to a callback custom call with no results.
#92794 merged
May 7, 2025 -
PR #26101: Allow bitcasts in dynamic memcpy fusions.
#92785 merged
May 7, 2025 -
PR #26111: [NFC] Refactor ResolveFunctionalDependencyOnInductionVariable.
#92866 merged
May 7, 2025 -
[XLA:GPU] Add simple triton support test for tuple ops
#91496 merged
May 7, 2025 -
[XLA:CPU] Expose interface for rendering execution graphs via dot API
#92618 merged
May 7, 2025 -
Rollback oneDNN v3.7 + ACL v24.12 update in aarch64 build.
#92809 merged
May 7, 2025 -
Use
shared_ptr
instead ofunique_ptr
for executable references#92826 merged
May 7, 2025 -
PR #25959: [ROCm] fixed missing headers + deps
#92702 merged
May 7, 2025 -
[xla] Add a name to execution graph operations
#92687 merged
May 7, 2025 -
Temporarily add
UBDialect
as a dependent dialect inSdyRoundTripStablehloToHloToStablehloPass
.#92872 merged
May 7, 2025 -
[IFRT] Exclude sdy dialect from the verification conducted by the IFRT SPMD expandable verification pass
#92770 merged
May 7, 2025 -
[XLA:GPU] Integrate perf tables to SoL estimator.
#92412 merged
May 7, 2025 -
Bump the Shardy XLA pin.
#92869 merged
May 7, 2025 -
Introduce a type alias for array references
#92800 merged
May 7, 2025 -
Automated Code Change
#92862 merged
May 7, 2025 -
fix(WeakTrackableReference): correct weak reference type handling
#92336 merged
May 7, 2025 -
Temporarily roll back changes for new LLVM version
#92828 merged
May 7, 2025 -
Fork remaining passes to tf_passes and related .mlir tests in with new namespace
#92541 merged
May 7, 2025 -
Use
LoadedExecutableRef
instead ofstd::unique_ptr<LoadedExecutable>
#92815 merged
May 6, 2025 -
Replace usage of scipy.misc.derivative in TensorFlow.
#92822 merged
May 6, 2025 -
Add two optimization patterns replacing
x + (y - y)
or(y - y) + x
withx
.#92811 merged
May 6, 2025 -
Integrate StableHLO at openxla/stablehlo@63246aea
#92446 merged
May 6, 2025 -
[XLA] Remove UB dialect from XLA dependencies.
#92722 merged
May 6, 2025 -
[xla:gpu] Detect empty traced CUDA graphs
#92806 merged
May 6, 2025 -
Add Fprint128ToBytes utility function.
#92749 merged
May 6, 2025 -
Reverts 38a0e6bdaea46322fbeffb0902a8136fe8bc9130
#92803 merged
May 6, 2025 -
Remove tf/compiler/mlir/lite:tensorflow_lite from tf/compiler/mlir/quantization/tensorflow:passes
#92416 merged
May 6, 2025 -
Move BufferAllocationSliceProto to xla/service/buffer_assignment.proto
#92786 merged
May 6, 2025 -
Precompute set of buffers allocated/released per instruction.
#92739 merged
May 6, 2025 -
Add swizzle_mode to TMA attribute
#92790 merged
May 6, 2025 -
[xla:ffi] Fix typo in struct size used during DeviceOrdinal decoding.
#92797 merged
May 6, 2025 -
PR #26100: Clean up special cases for AsyncStart in HloComputation.
#92780 merged
May 6, 2025 -
Add a bazel config setting to indicate whether the code is built in a CI.
#92516 merged
May 6, 2025 -
Introduce aliases for executable references
#92792 merged
May 6, 2025 -
PR #22788: Implement GetCompiledMemoryStats for GPU AOT executables
#92796 merged
May 6, 2025 -
Integrate support for tensordesc structs and reenable descriptor tests
#92411 merged
May 6, 2025 -
Add proto serialization for the SequentialThunk
#92772 merged
May 6, 2025 -
[XLA:GPU] Add CuBLAS autotuner backend.
#92269 merged
May 6, 2025 -
Rewrite the argument filtering code to use argument masks
#92410 merged
May 6, 2025 -
[XLA:GPU]: Remove GpuPerformanceModelOptions
#92717 merged
May 6, 2025 -
#sdy Handle the case where an output of a
sdy.manual_computation
has a dimension of size zero.#92781 merged
May 6, 2025 -
Remove unused dependency on flatbuffer_import
#92728 merged
May 6, 2025 -
[XLA:GPU] Add support for
.txtpb
files.#92743 merged
May 6, 2025 -
[XLA:GPU] Remove unused
trace
argument from CreateMLIRModule.#92779 merged
May 6, 2025 -
Drop unneeded dep from dialect
#92606 merged
May 6, 2025 -
PR #26046: Generalize dynamic memcpy support.
#92713 merged
May 6, 2025 -
Automated Code Change
#92691 merged
May 6, 2025 -
[xla:gpu][triton] Update TMA lowering in XLA to use new api
#92716 merged
May 6, 2025 -
[XLA:GPU] Collect matmul perf tables on forge.
#92712 merged
May 6, 2025 -
[XLA] Disable dynamic-slice async conversion until we correctly account for operands.
#92757 merged
May 6, 2025 -
Automated Code Change
#92647 merged
May 6, 2025 -
Remove unused header and dep.
#92707 merged
May 6, 2025 -
[XLA:GPU] Remove xla_gpu_unsafe_pipelined_loop_annotator flag
#89920 merged
May 6, 2025 -
[XLA:GPU] Remove special case for send/recv in async events mapping
#92740 merged
May 6, 2025 -
Fix missing dep.
#92727 merged
May 6, 2025 -
[XLA:GPU] Remove brittle pipeline parallelism detection in the SPMD partitioner pass
#90065 merged
May 6, 2025 -
Support platform / memory space allocator in TfrtGpuClient.
#92750 merged
May 6, 2025 -
[xla:pjrt] Add micro benchmarks for StreamExecutorGpuClient
#92738 merged
May 6, 2025 -
Initialize fp16_to_fp32 with identity map
#92633 merged
May 6, 2025 -
[XLA:GPU] Move all_reduce_splitter to collectives folder
#92511 merged
May 6, 2025 -
Add tfl.EmbeddingLookup DRQ
#92742 merged
May 6, 2025 -
[XLA:GPU] Move all_reduce_decomposer to collectives folder
#92512 merged
May 6, 2025 -
[XLA:GPU] Move all_reduce_blueconnect to collectives folder
#92509 merged
May 5, 2025 -
Updates IOPDDL conversion to shift edge costs by a constant amount if negative values are present.
#92726 merged
May 5, 2025 -
Implement TransferToInfeed and TransferFromOutfeed in pjrt async gpu client.
#92588 merged
May 5, 2025 -
Internal only changes
#92733 merged
May 5, 2025 -
[XLA:GPU] Move all_gather_dynamic_slice_simplifier to collectives folder
#92510 merged
May 5, 2025 -
Automated Code Change
#92719 merged
May 5, 2025 -
[XLA:GPU] Move all_gather_optimizer to collectives folder
#92508 merged
May 5, 2025 -
Refactor HloRunnerPjRt to distinguish loaded and unloaded PjRtExecutables.
#92630 merged
May 5, 2025 -
[IFRT] Switch
std::shared_ptr<xla::ifrt::Sharding>
toxla::ifrt::ShardingRef
#92627 merged
May 5, 2025 -
Prepare VarHandle ops too
#92721 merged
May 5, 2025 -
Avoid host scheduling dependencies between two GPU CUDA requests
#91890 merged
May 5, 2025 -
Implement Auotuner Backend interface for Triton.
#92710 merged
May 5, 2025 -
[XLA:GPU] update autotune db entry
#92715 merged
May 5, 2025 -
[XLA:GPU] Add simple triton support test for map op
#91655 merged
May 5, 2025 -
Automated Code Change
#92645 merged
May 5, 2025 -
Remove more if_gpu_is_configured usages from service/gpu/BUILD
#92486 merged
May 5, 2025 -
[XLA:GPU] Check whether the propagated tile offsets can be used.
#90805 merged
May 5, 2025 -
PR #25953: [XLA:FFI][BugFix] Add Token Args and Rets for typed FFI on CPU backend
#92709 merged
May 5, 2025 -
PR #25916: Added SafeStaticInit for deadlock free safe static initialization
#92708 merged
May 5, 2025 -
PR #25965: [NVIDIA GPU] Fix fabric info test
#92607 merged
May 5, 2025 -
PR #26003: Run fewer analyses in HloEvaluator.
#92616 merged
May 5, 2025 -
PR #26001: [NVIDIA GPU] Change UB register cache to include size in the key
#92705 merged
May 5, 2025 -
PR #24279: Handle FP8 bitcasting when lowering tensors
#92611 merged
May 5, 2025 -
PR #26012: Convert computations to MakeNonfusionComputations in async wrappers
#92704 merged
May 5, 2025 -
[XLA:Python] [JAX] Move HloPass bindings to XLA/Python.
#92688 merged
May 5, 2025 -
Automated Code Change
#92676 merged
May 4, 2025
228 Pull requests opened by 7 people
-
Get rid of `HloModuleProto` from `CompiledMemoryStats`
#92690 opened
May 4, 2025 -
Automated Code Change
#92692 opened
May 5, 2025 -
Automated Code Change
#92693 opened
May 5, 2025 -
Automated Code Change
#92694 opened
May 5, 2025 -
Automated Code Change
#92695 opened
May 5, 2025 -
Automated Code Change
#92696 opened
May 5, 2025 -
Automated Code Change
#92697 opened
May 5, 2025 -
Automated Code Change
#92698 opened
May 5, 2025 -
Automated Code Change
#92699 opened
May 5, 2025 -
Automated Code Change
#92700 opened
May 5, 2025 -
Automated Code Change
#92701 opened
May 5, 2025 -
TfLite elementwise_ops add type support (#104)
#92706 opened
May 5, 2025 -
TfLite exp missing datatype support (#69) (#121)
#92714 opened
May 5, 2025 -
Explicitly include `errors.h` in `status_macros.h`. We need `tsl::error::INTERNAL` from it.
#92723 opened
May 5, 2025 -
bug fix in literal comparison in `equal` test.
#92724 opened
May 5, 2025 -
remove transitive lite dependencies from tf_passes
#92731 opened
May 5, 2025 -
[xla::PyClient] Update PyClient to use xla::ifrt::CompileAndLoad.
#92732 opened
May 5, 2025 -
PR #25635: Add parsing debug options support
#92734 opened
May 5, 2025 -
[XLA:GPU] Return an error if `GetSchedulerMemoryLimit` fails.
#92735 opened
May 5, 2025 -
Collect execution time for mlrt.
#92744 opened
May 5, 2025 -
[Cleanup] Use CHECK_NOTNULL and CHECK_OK
#92745 opened
May 5, 2025 -
Fix clang-format and BUILD dependency issues
#92751 opened
May 5, 2025 -
Update GetLiveNodes API to be a bit more robust to worker restarts.
#92752 opened
May 5, 2025 -
Integrate LLVM at llvm/llvm-project@0009a1783490
#92754 opened
May 5, 2025 -
Automated Code Change
#92756 opened
May 5, 2025 -
[JAX] Make fully replicated sharding to avoid materializing the same host buffers
#92759 opened
May 6, 2025 -
Automated Code Change
#92760 opened
May 6, 2025 -
Automated Code Change
#92761 opened
May 6, 2025 -
Automated Code Change
#92762 opened
May 6, 2025 -
Automated Code Change
#92764 opened
May 6, 2025 -
Automated Code Change
#92765 opened
May 6, 2025 -
Fix missing dep.
#92766 opened
May 6, 2025 -
Automated Code Change
#92767 opened
May 6, 2025 -
Delete constructor for tf_allocator_adapter.
#92768 opened
May 6, 2025 -
Automated Code Change
#92773 opened
May 6, 2025 -
Set wheel CUDA versions to be minor-version lower bound.
#92774 opened
May 6, 2025 -
Automated Code Change
#92775 opened
May 6, 2025 -
[XLA:GPU][Emitters] Use new heuristic for transpose emitter
#92776 opened
May 6, 2025 -
PR #26100: Clean up special cases for AsyncStart in HloComputation.
#92778 opened
May 6, 2025 -
[xla:gpu] Fork existing fusion_emitter_int4_device_test.cc and make it use generic emitter.
#92801 opened
May 6, 2025 -
Add [de]serialization for `GemmConfig`.
#92802 opened
May 6, 2025 -
Set remaining profile options and perform validation on the advanced_configuration keys.
#92804 opened
May 6, 2025 -
Add [de]serialization for `BufferAllocationSlice`.
#92805 opened
May 6, 2025 -
[XLA:Collective] Cache the results from GetParticipatingFlattenedIdGroups().
#92813 opened
May 6, 2025 -
Add specialization for `PrintToStream` in `TF_RET_CHECK`'s additional output.
#92814 opened
May 6, 2025 -
Reverts 6932184196b02be51a5be4ca1090a422f897dc4e
#92817 opened
May 6, 2025 -
Integrate LLVM at llvm/llvm-project@3e235a7c601d
#92818 opened
May 6, 2025 -
Make Shape reference-counted with copy-on-write.
#92819 opened
May 6, 2025 -
Replace `std::shared_ptr<xla::ifrt::LoadedExecutable>` with `xla::ifrt::LoadedExecutableRef`
#92827 opened
May 7, 2025 -
Prevent MSA from creating a worst case unbalanced buffer interval tree when allocating scoped vmem.
#92829 opened
May 7, 2025 -
Use utility functions to check number of inputs and outputs.
#92832 opened
May 7, 2025 -
Automated Code Change
#92833 opened
May 7, 2025 -
Automated Code Change
#92834 opened
May 7, 2025 -
Automated Code Change
#92835 opened
May 7, 2025 -
Automated Code Change
#92836 opened
May 7, 2025 -
Automated Code Change
#92837 opened
May 7, 2025 -
Automated Code Change
#92838 opened
May 7, 2025 -
Automated Code Change
#92839 opened
May 7, 2025 -
Automated Code Change
#92840 opened
May 7, 2025 -
Automated Code Change
#92841 opened
May 7, 2025 -
Automated Code Change
#92842 opened
May 7, 2025 -
Automated Code Change
#92843 opened
May 7, 2025 -
Automated Code Change
#92844 opened
May 7, 2025 -
Automated Code Change
#92845 opened
May 7, 2025 -
Automated Code Change
#92846 opened
May 7, 2025 -
Automated Code Change
#92847 opened
May 7, 2025 -
Automated Code Change
#92848 opened
May 7, 2025 -
Automated Code Change
#92849 opened
May 7, 2025 -
Automated Code Change
#92850 opened
May 7, 2025 -
Automated Code Change
#92851 opened
May 7, 2025 -
Automated Code Change
#92852 opened
May 7, 2025 -
Automated Code Change
#92853 opened
May 7, 2025 -
Automated Code Change
#92854 opened
May 7, 2025 -
Automated Code Change
#92855 opened
May 7, 2025 -
[XLA:GPU] Add simple triton support test for topK op
#92856 opened
May 7, 2025 -
Automated Code Change
#92858 opened
May 7, 2025 -
Automated Code Change
#92859 opened
May 7, 2025 -
Automated Code Change
#92861 opened
May 7, 2025 -
Automated Code Change
#92864 opened
May 7, 2025 -
Automated Code Change
#92865 opened
May 7, 2025 -
[xla:gpu] add extract_tma_info pass
#92867 opened
May 7, 2025 -
Update TmaDescriptor creation to use triton's internal swizzling decision
#92868 opened
May 7, 2025 -
Automated Code Change
#92870 opened
May 7, 2025 -
Automated Code Change
#92871 opened
May 7, 2025 -
Reverts f6269e13f1d709b4fe46079e36f1407f65075271
#92874 opened
May 7, 2025 -
Reenable tma_to_llvm.mlir test
#92875 opened
May 7, 2025 -
feat: add datatype support for add, ceil, mul, range, sign, sub
#92880 opened
May 7, 2025 -
[XLA:CPU][roll-forward] Internal fixes.
#92894 opened
May 7, 2025 -
fork quantize_passes to use tf_passes
#92895 opened
May 7, 2025 -
Integrate LLVM at llvm/llvm-project@ed8d4fe0b1d8
#92896 opened
May 7, 2025 -
Custom NVPTX backend.
#92900 opened
May 7, 2025 -
Integrate LLVM at llvm/llvm-project@2d287f51eff2
#92901 opened
May 7, 2025 -
Introduce the `BUFFER` primitive type and update the `Shape` class to support it.
#92906 opened
May 7, 2025 -
[XLA:benchmarks] Add workflow files to generate GHA input matrices and a presubmit workflow
#92912 opened
May 7, 2025 -
Add traceme while borrowing streams
#92917 opened
May 7, 2025 -
Implement default `layout` in IfrtProxyArray
#92923 opened
May 8, 2025 -
Automated Code Change
#92925 opened
May 8, 2025 -
Automated Code Change
#92926 opened
May 8, 2025 -
Automated Code Change
#92927 opened
May 8, 2025 -
Automated Code Change
#92928 opened
May 8, 2025 -
Automated Code Change
#92929 opened
May 8, 2025 -
Automated Code Change
#92930 opened
May 8, 2025 -
Automated Code Change
#92932 opened
May 8, 2025 -
Automated Code Change
#92933 opened
May 8, 2025 -
Automated Code Change
#92934 opened
May 8, 2025 -
Automated Code Change
#92935 opened
May 8, 2025 -
Automated Code Change
#92936 opened
May 8, 2025 -
Automated Code Change
#92937 opened
May 8, 2025 -
Automated Code Change
#92947 opened
May 8, 2025 -
Automated Code Change
#92949 opened
May 8, 2025 -
Increase random UUID space in the fingerprint proto.
#92952 opened
May 8, 2025 -
Automated Code Change
#92954 opened
May 8, 2025 -
Handle non-normalized layouts for TMA.
#92957 opened
May 8, 2025 -
[xla:gpu] Use enum for swizzle mode in tma_descriptor attributes.
#92958 opened
May 8, 2025 -
[XLA:GPU] Add option to always fold constants
#92962 opened
May 8, 2025 -
fork stablehlo-quant-opt to tf-stablehlo-quant-opt without lite deps
#92972 opened
May 8, 2025 -
[XLA:GPU] Add triton support test for triangular-solve op
#92973 opened
May 8, 2025 -
[ifrt] Refactor away from deprecated constructors
#92978 opened
May 8, 2025 -
Replace calls to <ctype.h> functions with their absl:: equivalents.
#92981 opened
May 8, 2025 -
Convert remaining `native.py_*` usages to load from rules_python
#92984 opened
May 8, 2025 -
Remove TfrtCpuClient::BufferFromHostLiteral in favor of CommonPjRtClient::BufferFromHostLiteral.
#92985 opened
May 8, 2025 -
Make HloInstruction::name_ only take 8 bytes inline in the object
#92986 opened
May 8, 2025 -
[tosa] Fix `obj.isa` -> `mlir::isa` in an assertion
#92987 opened
May 8, 2025 -
Consistently apply preprocessing in HloRunnerAgnosticTestBase + friends.
#92988 opened
May 8, 2025 -
Integrate LLVM at llvm/llvm-project@741fef3a4453
#92993 opened
May 9, 2025 -
Move CreateViewOfDeviceBuffer and BufferFromHostBuffer into CommonPjRtClient
#92994 opened
May 9, 2025 -
Remove support for dimension level type.
#92995 opened
May 9, 2025 -
Automated Code Change
#92996 opened
May 9, 2025 -
Automated Code Change
#92997 opened
May 9, 2025 -
Automated Code Change
#92998 opened
May 9, 2025 -
Automated Code Change
#92999 opened
May 9, 2025 -
Automated Code Change
#93000 opened
May 9, 2025 -
Automated Code Change
#93002 opened
May 9, 2025 -
Automated Code Change
#93004 opened
May 9, 2025 -
Automated Code Change
#93005 opened
May 9, 2025 -
Automated Code Change
#93006 opened
May 9, 2025 -
Automated Code Change
#93007 opened
May 9, 2025 -
Automated Code Change
#93009 opened
May 9, 2025 -
Make the recently-added RegisterPluggableDevicePlugin type-safe.
#93010 opened
May 9, 2025 -
PR #26255: [Nvidia GPU] Nvshmem communicator API
#93011 opened
May 9, 2025 -
Remove ConvertFakeQuantWithMinMaxVarsOp from TFLite lowering patterns.
#93012 opened
May 9, 2025 -
Automated Code Change
#93013 opened
May 9, 2025 -
Automated Code Change
#93014 opened
May 9, 2025 -
PR #25601: Fix NCCL user buffer allocation error message and unit tests for kCollective
#93015 opened
May 9, 2025 -
[XLA:CPU] Graph visualization called only for executor owned by CpuExecutable
#93016 opened
May 9, 2025 -
PR #26255: [Nvidia GPU] Nvshmem communicator API
#93017 opened
May 9, 2025 -
[XLA:CPU][XLA:GPU] Implement Shared `ForAll` op and use it in GPU loop fusion
#93020 opened
May 9, 2025 -
[XLA:CPU] Implement lowering of shared `forall` op
#93021 opened
May 9, 2025 -
[XLA:GPU] Add triton support test for convolution op
#93022 opened
May 9, 2025 -
[XLA:CPU] Add op type id to ExecutionGraph::Operation
#93023 opened
May 9, 2025 -
[XLA][graphviz] Graph visualization from a span of ops
#93026 opened
May 9, 2025 -
[XLA:CPU][XLA:GPU] Implement shared workgroup id op
#93027 opened
May 9, 2025 -
[XLA] Move execution graph renderer to execution_graph
#93028 opened
May 9, 2025 -
internal BUILD rule visibility
#93031 opened
May 9, 2025 -
Automated Code Change
#93032 opened
May 9, 2025 -
[XLA:GPU] Add custom kernel autotuning backend.
#93034 opened
May 9, 2025 -
[XLA:GPU] Only support cuBLAS custom calls in cuBLAS autotuning backend.
#93036 opened
May 9, 2025 -
Manually migrate deprecated references to `strings::StrCat` and `strings::StrAppend`.
#93039 opened
May 9, 2025 -
Add proto serialization for ShapeIndex
#93040 opened
May 9, 2025 -
Push down the logic that reads the default memory fraction for GPUs from `XLA_CLIENT_MEM_FRACTION`
#93041 opened
May 9, 2025 -
[XLA:CPU] Add lowering for `workgroup_id` to `thread_id`
#93042 opened
May 9, 2025 -
[XLA:CPU] Add ability to get non-x thread dimension in cpu fusion dialect.
#93043 opened
May 9, 2025 -
update sharding settings to remove empty shards in execution
#93045 opened
May 9, 2025 -
Use the shell rules https://github.com/bazelbuild/rules_shell.
#93047 opened
May 9, 2025 -
#sdy save shardings of the arguments on `CaseOp` during MHLO->HLO translation.
#93048 opened
May 9, 2025 -
Use low latency thread pool for async PjRT
#93052 opened
May 9, 2025 -
Don't print a warning when a dependency repo is already defined.
#93055 opened
May 9, 2025 -
[XLA:CPU] Expose an interface to get nested thunks
#93056 opened
May 9, 2025 -
[XLA][graphviz] Visualize nested thunks as clusters
#93057 opened
May 9, 2025 -
Fix cost analysis on for output byte accessed when result is tuple
#93059 opened
May 9, 2025 -
Allow simplify_ici_dummy_variables to be triggered with TPUExecuteAndUpdateVariables nodes.
#93060 opened
May 9, 2025 -
Upgrade bazel_skylib to 1.7.1
#93061 opened
May 9, 2025 -
Log before and after calling test subprocess.
#93062 opened
May 9, 2025 -
Internal visibility changes
#93064 opened
May 9, 2025 -
Remove the semaphore in ExecuteHelper
#93066 opened
May 9, 2025 -
Automated Code Change
#93069 opened
May 10, 2025 -
Automated Code Change
#93070 opened
May 10, 2025 -
Automated Code Change
#93073 opened
May 10, 2025 -
Automated Code Change
#93074 opened
May 10, 2025 -
Automated Code Change
#93075 opened
May 10, 2025 -
Automated Code Change
#93076 opened
May 10, 2025 -
Automated Code Change
#93077 opened
May 10, 2025 -
Automated Code Change
#93078 opened
May 10, 2025 -
Cache invalidation for XProf
#93080 opened
May 10, 2025 -
Automated Code Change
#93081 opened
May 10, 2025 -
Automated Code Change
#93082 opened
May 10, 2025 -
Automated Code Change
#93083 opened
May 10, 2025 -
Automated Code Change
#93084 opened
May 10, 2025 -
Automated Code Change
#93085 opened
May 10, 2025 -
Automated Code Change
#93086 opened
May 10, 2025 -
Automated Code Change
#93087 opened
May 10, 2025 -
Automated Code Change
#93088 opened
May 10, 2025 -
Automated Code Change
#93090 opened
May 10, 2025 -
Automated Code Change
#93091 opened
May 10, 2025 -
Automated Code Change
#93093 opened
May 10, 2025 -
Update activations.py
#93094 opened
May 10, 2025 -
[XLA] Simplify CheckInplaceCollectivePermute.
#93096 opened
May 10, 2025 -
Reverts fdde76d61ee110eb0e1dad27e9af5589bda9ef11
#93097 opened
May 10, 2025 -
[XLA] Remove ShapeUtil::GetSubshapeOneIndex().
#93098 opened
May 10, 2025 -
Use a tile assignment to store device groups in grouped sharding.
#93103 opened
May 10, 2025 -
Fix a bug where TFRT GPU does not handle token literals correctly
#93104 opened
May 10, 2025 -
Automated Code Change
#93106 opened
May 11, 2025 -
Automated Code Change
#93107 opened
May 11, 2025 -
Automated Code Change
#93108 opened
May 11, 2025 -
Automated Code Change
#93109 opened
May 11, 2025 -
Automated Code Change
#93110 opened
May 11, 2025 -
Automated Code Change
#93111 opened
May 11, 2025 -
Automated Code Change
#93112 opened
May 11, 2025 -
Automated Code Change
#93113 opened
May 11, 2025 -
Automated Code Change
#93114 opened
May 11, 2025 -
Automated Code Change
#93115 opened
May 11, 2025 -
Automated Code Change
#93116 opened
May 11, 2025 -
Automated Code Change
#93117 opened
May 11, 2025 -
Automated Code Change
#93118 opened
May 11, 2025 -
Automated Code Change
#93119 opened
May 11, 2025 -
Automated Code Change
#93120 opened
May 11, 2025 -
Automated Code Change
#93121 opened
May 11, 2025 -
Automated Code Change
#93122 opened
May 11, 2025 -
Automated Code Change
#93123 opened
May 11, 2025 -
Remove Tfrt from PJRT GPU buffer type names
#93124 opened
May 11, 2025 -
Add complex64 and complex128 support to tf.truncatediv
#93125 opened
May 11, 2025 -
Automated Code Change
#93126 opened
May 11, 2025 -
Automated Code Change
#93127 opened
May 11, 2025 -
Automated Code Change
#93128 opened
May 11, 2025 -
Automated Code Change
#93129 opened
May 11, 2025
17 Issues closed by 5 people
-
The pip instructions to download and install tensorflow for macOS don't work
#92062 closed
May 11, 2025 -
tensorflow dll issue
#93008 closed
May 10, 2025 -
DLL load failed while importing _pywrap_tensorflow_internal
#92941 closed
May 10, 2025 -
Grateful for Your Work — Open to Collaborate
#93092 closed
May 10, 2025 -
`tf.math.imag` doesn't support float and double
#92876 closed
May 10, 2025 -
warning in official
#91733 closed
May 9, 2025 -
Compiling TFLite introductory example fails due to missing headers
#91782 closed
May 9, 2025 -
check
#92920 closed
May 8, 2025 -
Install TensorFlow with pip for macOS doesn't work on M4 MacBook Pro Apple Silicon
#91681 closed
May 7, 2025 -
Tensorflow model.save fails with AttributeError: 'Variable' object has no attribute 'read_value'
#91686 closed
May 7, 2025 -
Loading tensorflow DLL error in Spyder other python ok
#91709 closed
May 7, 2025 -
Mac Air M2, TfLiteGpuDelegate, Cpp, C++ Building from source
#86075 closed
May 6, 2025 -
FP32 gives good results but int8 predicts only single class
#89438 closed
May 6, 2025 -
Tensorflow Build Error
#91558 closed
May 6, 2025 -
multiprocessing incompatibility
#91647 closed
May 6, 2025 -
MemoryError during "import tensorflow" with tensorflow-cpu==2.19.0
#89268 closed
May 5, 2025
10 Issues opened by 7 people
-
clang: error: unknown argument: '-fno-canonical-system-headers'
#93049 opened
May 9, 2025 -
Inconsistencies between the implementation and documentation of `tf.strided_slice`
#92955 opened
May 8, 2025 -
Build TensorFlow Lite C library Error: no such instruction: 'vdpbf16ps zmm7, zmm2, zmm10'
#92945 opened
May 8, 2025 -
[iOS 18] EXC_BAD_ACCESS when initializing TFLite model on device running iOS 18.4.1
#92924 opened
May 8, 2025 -
`tf.truncatediv` doesn't work on complex64 and complex128
#92922 opened
May 8, 2025 -
`tf.math.reciprocal` doesn't work on int8, int16 and int32
#92921 opened
May 8, 2025 -
`tf.math.igamma` doesn't work when dtype is bfloat16 or half
#92873 opened
May 7, 2025 -
Different inference results on CPU vs GPU for simple model on macOS M1
#92686 opened
May 4, 2025
59 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Problem when using tf-to-tosa-pipeline
#92660 commented on
May 5, 2025 • 0 new comments -
Error when converting tensorflow model in mlir
#92659 commented on
May 5, 2025 • 0 new comments -
GPU kernel for `tf.linalg.eig` / `tf.linalg.eigvals`
#92194 commented on
May 5, 2025 • 0 new comments -
New version of nVidia Windows GPU driver (576+) causing TF in WSL process to crash.
#92251 commented on
May 5, 2025 • 0 new comments -
TensorFlow on RTX 5090
#89272 commented on
May 5, 2025 • 0 new comments -
The default value for skip_gradients_aggregation (False) of optimizers prevents freeing memory
#92094 commented on
May 5, 2025 • 0 new comments -
Impossible to free GPU memory used by rank 0 tensors
#92103 commented on
May 5, 2025 • 0 new comments -
How to setup tensorflow environment in google colab for training ai models using tensorflow object detection APIs?
#91738 commented on
May 6, 2025 • 0 new comments -
[TFLite] GPU delegation issues with Vision Transformer (ViT-base) in TFLite: dynamic shape & op support problems
#92098 commented on
May 6, 2025 • 0 new comments -
Muting Tensorflow Lite logs
#92216 commented on
May 6, 2025 • 0 new comments -
model load problem
#92044 commented on
May 7, 2025 • 0 new comments -
Compiling Tensorflow v2.19.0 with mkl_aarch64 fails.
#92279 commented on
May 7, 2025 • 0 new comments -
GPU Illegal memory access in kernel DenseToCSRSparseMatrix
#92413 commented on
May 7, 2025 • 0 new comments -
Tensorflow for Arch Linux
#92550 commented on
May 7, 2025 • 0 new comments -
No way to use tensorflow with cuda on windows. cudaGetDevice() failed.
#92523 commented on
May 10, 2025 • 0 new comments -
"Given shapes (...) are not broadcastable" after conversion onnx->tf->tflite
#92037 commented on
May 10, 2025 • 0 new comments -
Setting the protobuf version at installation with bazel
#92610 commented on
May 10, 2025 • 0 new comments -
It doesn't support on python3.13
#78774 commented on
May 11, 2025 • 0 new comments -
Add ComplexOp as a builtin to TFLite
#61359 commented on
May 5, 2025 • 0 new comments -
Support for arbitrary tensor sizes in `tfl.strided_slice` lowering
#61424 commented on
May 5, 2025 • 0 new comments -
Build kernel for all numbertypes for ApproximateEqual Op
#64587 commented on
May 5, 2025 • 0 new comments -
Fix checkfail in MatrixSetDiagV3
#65702 commented on
May 5, 2025 • 0 new comments -
iwyu: missing includes in xla
#66450 commented on
May 5, 2025 • 0 new comments -
Add GPU configuration options to TensorFlow Go bindings
#81989 commented on
May 7, 2025 • 0 new comments -
Remove Transfer-Encoding: identity considered to be illegal
#83082 commented on
May 5, 2025 • 0 new comments -
[oneDNN][CPU] fuse a matmul pattern
#86172 commented on
May 5, 2025 • 0 new comments -
fix(kernels): Handle empty values with non-empty row splits in RaggedTensorToTensor
#86349 commented on
May 5, 2025 • 0 new comments -
Fix compile error in tensorflow/python/tfcompile_wrapper.cc on s390x
#87676 commented on
May 5, 2025 • 0 new comments -
TfLite. Fix of issue 61269
#87946 commented on
May 10, 2025 • 0 new comments -
TfLite. Fix of issue 54269
#88020 commented on
May 11, 2025 • 0 new comments -
[XLA:CPU][tfcompile] Use thunk runtime for tfcompiled models.
#88552 commented on
May 6, 2025 • 0 new comments -
Fix: Ensure boolean_mask_v2() only accepts boolean dtype for mask
#89370 commented on
May 5, 2025 • 0 new comments -
Move duplicate CUDA/XLA registration logs from INFO to VLOG
#89808 commented on
May 5, 2025 • 0 new comments -
Adapt commit 5467ee9 from XLA to TensorFlow
#90563 commented on
May 6, 2025 • 0 new comments -
Create a PjRt protos dir.
#90719 commented on
May 9, 2025 • 0 new comments -
adam opt .
#90795 commented on
May 5, 2025 • 0 new comments -
#sdy Properly handle token types in JAX and `ManualComputationOp`.
#91048 commented on
May 7, 2025 • 0 new comments -
Enable Android XNNPACK delegate num_threads with Interpreter.Options.…
#91181 commented on
May 6, 2025 • 0 new comments -
experimental test for dup registration bug
#91758 commented on
May 7, 2025 • 0 new comments -
Major deps update:
#92241 commented on
May 11, 2025 • 0 new comments -
[XLA:ReplicaGroupV2] Propagate ReplicaGroupV2 to more places. (NFC)
#92319 commented on
May 7, 2025 • 0 new comments -
[XLA:GPU][Emitters] Scan through the IR to compute shared memory usage.
#92427 commented on
May 5, 2025 • 0 new comments -
[XLA] Add F8E8M0FNU to IsF8Type
#92441 commented on
May 9, 2025 • 0 new comments -
feat: add datatype support for tfl.transpose
#92472 commented on
May 6, 2025 • 0 new comments -
Do not create BFCAllocator if not needed
#92483 commented on
May 5, 2025 • 0 new comments -
Update Riegeli dependency to the latest commit.
#92502 commented on
May 10, 2025 • 0 new comments -
Fixes #58106: Add shape validation to AvgPool3DGrad input to prevent crash
#92521 commented on
May 8, 2025 • 0 new comments -
Add benchmark for Shape assignment.
#92548 commented on
May 8, 2025 • 0 new comments -
Switch uses of `shape.DebugString()` to `shape.ToString()` or `shape.ToProto().DebugString()`.
#92572 commented on
May 5, 2025 • 0 new comments -
PR #23884: [NVIDIA] Fix a cast error in checking overlap limit function in LHS
#92573 commented on
May 9, 2025 • 0 new comments -
Composite,CustomCall,DotGeneral,Dot,DynamicConv,DynamicGather,DynamicIota,DynamicPad,DynamicReshape,Reshape: Direct StableHLO to HLO Translation
#92592 commented on
May 6, 2025 • 0 new comments -
Automated Code Change
#92597 commented on
May 7, 2025 • 0 new comments -
Automated Code Change
#92598 commented on
May 7, 2025 • 0 new comments -
Automated Code Change
#92600 commented on
May 9, 2025 • 0 new comments -
Fix a bug in `RandomPathGreedy` when tracking the objective value over iterations; previously, it adds `cost_delta` to `current_cost` even if a new strategy is rejected due to the peak-memory violation. Also, streamline the previous CL by combining the previous two phases (Phase I and II).
#92622 commented on
May 8, 2025 • 0 new comments -
Remove the lite/toco protobuf files out of //tensorflow/core:lib_internal_impl
#92624 commented on
May 5, 2025 • 0 new comments -
Implement Hadamard rotation reference as a custom op
#92635 commented on
May 7, 2025 • 0 new comments -
Automated Code Change
#92649 commented on
May 6, 2025 • 0 new comments -
Automated Code Change
#92681 commented on
May 7, 2025 • 0 new comments